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(57) Abstract 



A biosynthetic method for producing vitamin C (ascorbic acid, L-ascorbic acid, or AA) is disclosed. Such a method includes 
fermentation of a genetically modified microorganism or plant to produce L-ascorbic acid. In particular, the present invention relates 
to the use of microorganisms and plants having at least one genetic modification to increase the action of an enzyme involved in 
the ascorbic acid biosynthetic pathway. Included is the use of nucleotide sequences encoding epimerases, including the endogenous 
GDP-D~mannose:GDP-L-galactose epimerase from the L-ascorbic acid pathway and homologues thereof for the purposes of improving 
the biosynthetic production of ascorbic acid. The present invention also relates to genetically modified microorganisms, such as strains of 
microalgae, bacteria and yeast useful for producing L-ascorbic acid, and to genetically modified plants, useful for producing consumable 
plant food products. 
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VITAMIN C PRODUCTION IN MICROORGANISMS AND PLANTS 



FIELD OF THE INVENTION 
The present invention relates to vitamin C (L-ascorbic acid) production using 
genetically modified microorganisms and plants. In particular, the present invention 
5 relates to the use of nucleotide sugar epimerase enzymes for the biological production of 
ascorbic acid in plants and microorganisms. 

BACKGROUND OF THE INVENTION 
Nearly all forms of life, both plant and animal, either synthesize ascorbic acid 
(vitamin C) or require it as a nutrient. Ascorbic acid was first identified to be useful as a 

10 dietary supplement for humans and animals for the prevention of scurvy. Ascorbic acid, 
however, also affects human physiological functions such as the adsorption of iron, cold 
tolerance, the maintenance of the adrenal cortex, wound healing, the synthesis of 
polysaccharides and collagen, the formation of cartilage, dentine, bone and teeth, the 
maintenance of capillaries, and is useful as an antioxidant. 

15 For use as a dietary supplement, ascorbic acid can be isolated from natural 

sources, such as rosehips, synthesized chemically through the oxidation of L-sorbose, or 
produced by the oxidative fermentation of calcium D-gluconate by Acetobacter 
subaxidans. Considine, "Ascorbic Acid," Van Nostrarufs Scientific Encyclopedia, Vol. 
1, pp. 237-238, (1989). Ascorbic acid (predominantly intracellular) has also been 

20 obtained through the fermentation of strains of the microalga, Chlorella pyrenoidosa See 
U.S. Patent No. 5,001,059 by Skatrud, which is assigned to the assignee of the present 
application. It is believed that ascorbic acid is produced inside the chloroplasts of 
photosynthetic microorganisms and functions to neutralize energetic electrons produced 
during photosynthesis. Accordingly, ascorbic acid production is known in photosynthetic 

25 organisms as a protective mechanism. 

Therefore, products and processes which improve the ability to biosynthetically 
produce ascorbic acid are desirable and beneficial for the improvement of human health. 

SUMMARY OF THE INVENTION 
One embodiment of the present invention relates to a method for producing 
30 ascorbic acid or esters thereof in a microorganism. The method includes the steps of: (a) 
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culturing a microorganism having a genetic modification to increase the action of an 
enzyme selected from the group of hexokinase, glucose phosphate isomerase, 
phosphomannose isomerase, phosphomannomutase, GDP-D-mannose pyrophosphorylase, 
GDP-D-mannose: GDP-L-galactose epimerase, GDP-L-galactose phosphorylase, L- 
5 galactose- 1-P-phosphatase, L-galactose dehydrogenase, and/or L-galactono-y-lactone 
dehydrogenase; and (b) recovering the ascorbic acid or esters produced by the 
microorganism. Preferably, the genetic modification is a genetic modification to increase 
the action of an enzyme selected from the group of GDP-D-mannose: GDP-L-galactose 
epimerase, GDP-L-galactose phosphorylase, L-galactose- 1-P-phosphatase, L-galactose 

10 dehydrogenase, and/or L-galactono-y-lactone dehydrogenase. In one embodiment of the 
method of the present invention, the microorganism further includes a genetic 
modification to decrease the action of an enzyme having GDP-D-mannose as a substrate, 
other than GDP-D-mannose: GDP-L-galactose epimerase. Such a genetic modification 
can include, for example, a genetic modification to decrease the action of GDP-D- 

15 mannose-dehydrogenase. 

In one embodiment, the genetic modification is a genetic modification to increase 
the action of an epimerase that catalyzes conversion of GDP-D-mannose to GDP-L- 
galactose, which can include GDP-D-mannose: GDP-L-galactose epimerase. In one 
embodiment, the epimerase binds NADPH. In one embodiment of this method, the 

20 genetic modification includes transformation of the microorganism with a recombinant 
nucleic acid molecule that expresses the epimerase. Such an epimerase can have a tertiary 
structure that substantially conforms to the tertiary structure of a GDP-4-keto-6-deoxy-D- 
mannose epimerase/reductase represented by atomic coordinates having Brookhaven 
Protein Data Bank Accession Code lbws. Preferably, the epimerase has a structure 

25 having an average root mean square deviation of less than about 2.5 A, and more 
preferably less than about 1 A, over at least about 25% of Co positions of the tertiary 
structure of a GDP-4-keto-6-deoxy-D-mannose epimerase/reductase represented by 
atomic coordinates having Brookhaven Protein Data Bank Accession Code lbws. 

In one embodiment, the epimerase comprises a substrate binding site having a 

30 tertiary structure that substantially conforms to the tertiary structure of the substrate 
binding site of a GDP-4-keto-6-deoxy-D-mannose epimerase/reductase represented by 
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atomic coordinates having Brookhaven Protein Data Bank Accession Code lbws. Such 
a substrate binding site preferably has a tertiary structure with an average root mean 
square deviation of less than about 2.5 A over at least about 25% of Ca positions of the 
tertiary structure of a substrate binding site of a GDP-4-keto-6-deoxy-D-mannose 
5 epimerase/reductase represented by atomic coordinates having Brookhaven Protein Data 
Bank Accession Code lbws. 

In another embodiment, the epimerase comprises a catalytic site having a tertiary 
structure that substantially conforms to the tertiary structure of the catalytic site of a 
GDP^-keto-6-deoxy-D-mannose epimerase/reductase represented by atomic coordinates 

10 having Brookhaven Protein Data Bank Accession Code lbws. Such a catalytic she 
preferably has a tertiary structure with an average root mean square deviation of less than 
about 1 A over at least about 25% of Ca positions of the tertiary structure of a catalytic 
site of a GDP-4-keto-6-deoxy-D-mannose epimerase/reductase represented by atomic 
coordinates having Brookhaven Protein Data Bank Accession Code lbws. The catalytic 

15 site preferably includes the amino acid residues serine, tyrosine and lysine and in one 
embodiment, the tertiary structure positions of the amino acid residues serine, tyrosine and 
lysine substantially conform to tertiary structure positions of residues Serl07, Tyrl36 and 
Lysl40, respectively, as represented by atomic coordinates in Brookhaven Protein Data 
Bank Accession Code lbws, 

20 In yet another embodiment of this method, the epimerase comprises an amino acid 

sequence that aligns with SEQ ID NO: 1 1 using a CLUSTAL alignment program, wherein 
amino acid residues in the amino acid sequence align with 100% identity with at least 
about 50%, and in another embodiment with at least about 75%, and in yet another 
embodiment with at least about 90% of non-Xaa residues in SEQ ID NO: 11. In another 

25 embodiment, the epimerase comprises an amino acid sequence having at least 4 
contiguous amino acid residues that are 100% identical to at least 4 contiguous amino acid 
residues of an amino acid sequence selected from the group of SEQ ID NO: 2, SEQ ID 
NO:4, SEQ ID NO:6, SEQ ID NO:8 and SEQ ID NO: 10. In yet another embodiment, 
the recombinant nucleic acid molecule comprises a nucleic acid sequence comprising at 

30 least about 12 contiguous nucleotides having 100% identity with at least about 12 
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contiguous nucleotides of a nucleic acid sequence selected from the group of SEQ ID 
NO: 1, SEQ ID NO:3, SEQ ID NO:5, SEQ ID NO:7 and SEQ ID NO:9. 

In yet another embodiment of this method of the present invention, the epimerase 
comprises an amino acid sequence having a motif: Gly-Xaa-Xaa-Gly-Xaa-Xaa-Gly. In 
5 yet another embodiment, the recombinant nucleic acid molecule comprises a nucleic acid 
sequence that is at least about 15% identical, and in another embodiment, at least about 
20% identical, and in another embodiment, at least about 25% identical, to a nucleic acid 
sequence selected from the group of SEQ ID NO 1, SEQIDNO:3, SEQIDNO:5, SEQ 
ID NO:7 and SEQ ID NO:9, as determined using a Lipman-Pearson method with Lipman- 

10 Pearson standard default parameters. 

In yet another embodiment of this method of the present invention, the 
recombinant nucleic acid molecule comprises a nucleic acid sequence that hybridizes under 
stringent hybridization conditions to a nucleic acid sequence encoding a GDP-4-keto-6- 
deoxy-D-mannose epimerase/reductase. The nucleic acid sequence encoding the GDP-4- 

15 keto-6-deoxy-D-mannose epimerase/reductase includes nucleic acid sequences selected 
from the group of SEQ ID NO: 1, SEQ ID N03 and SEQ ID NO : 5, and the GDP-4-keto- 
6-deoxy-D-mannose epimerase/reductase can include an amino acid sequence selected 
from the group of SEQ ID NO:2, SEQ ID NO:4 and SEQ TD NO:6. 

In one embodiment of the method of the present invention, the microorganism is 

20 selected from the group of bacteria, fungi and microalgae. In one embodiment, the 
microorganism is acid-tolerant. Preferred bacteria include, but are not limited to 
Azotobacter and Pseudomonas. Preferred fungi include, but are not limited to, yeast, 
including, but not limited to Saccharomyces yeast. Preferred microalgae include, but are 
not limited to, microalgae of the genera Prototheca and Chlorella, with microalgae of the 

25 genus Prototheca being particularly preferred. 

In yet another embodiment of the method of the present invention, the 
microorganism is acid-tolerant and the step of cuhuring is conducted at a pH of less than 
about 6.0, and more preferably, at a pH of less than about 5.5, and even more preferably, 
at a pH of less than about 5.0. The step of culturing can be conducted in a fermentation 

30 medium that comprises a carbon source other than D-mannose in one embodiment, and 
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in another embodiment, the step of culturing is conducted in a fermentation medium that 
comprises glucose as a carbon source. 

In yet another embodiment of the present method, the step of culturing is 
conducted in a fermentation medium that is magnesium (Mg) limited. Preferably, the step 
5 of culturing is conducted in a fermentation medium that is Mg limited during a cell growth 
phase. In one embodiment, the fermentation medium includes less than about 0.5 g/L of 
Mg during a cell growth phase, and more preferably, less than about 0.2 g/L of Mg during 
a cell growth phase, and even more preferably, less than about 0. 1 g/L of Mg during a cell 
growth phase. 

10 Another embodiment of the present invention relates to a microorganism for 

producing ascorbic acid or esters thereof The microorganism has a genetic modification 
to increase the action of an enzyme selected from the group of hexokinase, glucose 
phosphate isomerase, phosphomannose isomerase, phosphomannomutase, GDP-D- 
mannose pyrophosphorylase, GDP-D-mannose:GDP-L-galactose epimerase, GDP-L- 

15 galactose phosphorylase, L-galactose- 1-P-phosphatase, L-galactose dehydrogenase, 
and/or L-galactono- y -lactone dehydrogenase. Preferably, the genetic modification is a 
genetic modification to increase the action of an enzyme selected from the group of GDP- 
D-mannose: GDP-L-galactose epimerase, GDP-L-galactose phosphorylase, L-galactose- 1- 
P-phosphatase, L-galactose dehydrogenase, and/or L-galactono-y -lactone dehydrogenase, 

20 and even more preferably, to increase the action of GDP-D-mannose: GDP-L-galactose 
epimerase. 

In one embodiment, the microorganism has been genetically modified to express 
a recombinant nucleic acid molecule encoding an epimerase that catalyzes conversion of 
GDP-D-mannose to GDP-L-galactose, wherein the epimerase has a tertiary structure 

25 having an average root mean square deviation of less than about 2.5 A over at least about 
25% of Cot positions of the tertiary structure of a GDP-4-keto-6-deoxy-D-mannose 
epimerase/reductase represented by atomic coordinates having Brookhaven Protein Data 
Bank Accession Code lbws. In another embodiment, the microorganism has been 
genetically modified to express a recombinant nucleic acid molecule encoding an 

30 epimerase that catalyzes conversion of GDP-D-mannose to GDP-L-galactose, wherein the 
epimerase comprises an amino acid sequence that aligns with SEQ ID NO: 1 1 using a 
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CLUSTAL alignment program, wherein amino acid residues in the amino acid sequence 
align with 100% identity with at least about 50% of non-Xaa residues in SEQ ID NO: 1 1 . 
Preferred microorganisms are disclosed as for the method discussed above. 

Yet another embodiment of the present invention relates to a plant for producing 
5 ascorbic acid or esters thereof. Such a plant has a genetic modification to increase the 
action of an enzyme selected from the group of hexokinase, glucose phosphate isomerase, 
phosphomannose isomerase, phosphomannomutase, GDP-D-mannose pyrophosphorylase, 
GDP-D-mannose:GDP-L-galactose epimerase, GDP-L-galactose phosphorylase, L- 
galactose-l-P-phosphatase, L-galactose dehydrogenase, and/or L-galactono-y -lactone 

10 dehydrogenase. In a preferred embodiment, the genetic modification is a genetic 
modification to increase the action of an enzyme selected from the group of GDP-D- 
mannose: GDP-L-galactose epimerase, GDP-L-galactose phosphorylase, L-galactose- 1-P- 
phosphatase, L-galactose dehydrogenase, and/or L-galactono-y -lactone dehydrogenase, 
and in a more preferred embodiment, the genetic modification is a genetic modification 

15 to increase the action of GDP-D-mannose: GDP-L-galactose epimerase. 

In one embodiment, the plant further comprises a genetic modification to decrease 
the action of an enzyme having GDP-D-mannose as a substrate other than GDP-D- 
mannose: GDP-L-galactose epimerase. Such a genetic modification includes a genetic 
modification to decrease the action of GDP-D-mannose-dehydrogenase. Such a plant also 

20 includes a plant that has been genetically modified to express a recombinant nucleic acid 
molecule encoding an epimerase that catalyzes conversion of GDP-D-mannose to GDP-L- 
galactose, wherein the epimerase has a tertiary structure having an average root mean 
square deviation of less than about 2.5 A over at least about 25% of Cct positions of the 
tertiary structure of a GDP-4-keto-6-d eoxy-D-manno se epimerase/reductase represented 

25 by atomic coordinates having Brookhaven Protein Data Bank Accession Code lbws. In 
another embodiment, such a plant has been genetically modified to express a recombinant 
nucleic acid molecule encoding an epimerase that catalyzes conversion of GDP-D- 
mannose to GDP-L-galactose, wherein the epimerase comprises an amino acid sequence 
that aligns with SEQ ID NO: 1 1 using a CLUSTAL alignment program, wherein amino 

3 0 acid residues in the amino acid sequence align with 1 00% identity with at least about 50% 
of non-Xaa residues in SEQ ID NO: 1 1 . 
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In one embodiment, a plant for producing ascorbic acid or esters thereof according 
to the present invention is a microalga. Preferred microalgae include, but are not limited 
to microalgae of the genera Prototheca and CMorella, with microalga of the genus 
Prototheca being particularly preferred. In another embodiment, the plant is a higher 
5 plant, with consumable higher plants being more preferred. 



BRIEF DESCRIPTION OF THE FIGURES 
Fig. 1 A is a schematic drawing of the pathway from glucose to GDP-D-mannose 
in plants. 

Fig. IB is a schematic drawing of the pathway from GDP-D-mannose to L- 
1 0 galactose- 1 -phosphate in plants. 

Fig. 1C is a schematic drawing of the pathway from L-galactose to L-ascorbic acid 
in plants. 

Fig. 2A is a schematic drawing of selected carbon flow from glucose in 
Prototheca. 

15 Fig. 2B is a schematic drawing of selected carbon flow from glucose in 

Prototheca. 

Fig. 3 is a schematic drawing that shows the lineage of mutants derived from 
Prototheca moriformis ATCC 75669, and their ability to produce L-ascorbic acid. 

Fig. 4 is a bar graph illustrating the conversion of substrates by resting cells of 
20 strain NA45-3 following growth in media containing various magnesium concentrations 
and resuspension in media containing various magnesium concentrations. 

Fig. 5 is a line graph showing the relationship between specific ascorbic acid 
formation in cultures of Prototheca strains and the specific activity of GDP-D- 
mannose: GDP-L-galactose epimerase in extracts prepared from cells harvested from the 
25 same cultures. 

Fig. 6 is a line graph showing the relationship between specific epimerase activity 
and the degree of magnesium limitation in two strains, ATCC 75669 and EMS 13-4. 

Fig. 7 depicts the overall catalytic mechanism of GDP-D-mannose:GDP-L- 
galactose epimerase proposed by Barber (1979, J. Biol Chem. 254:7600-7603). 
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Fig. 8A depicts the catalytic mechanism of GDP-D-mannose-4, 6-dehydratase 
(converts GDP-D-mannose to GDP-4-keto-6-deoxy-D-mannose). 

Fig. 8B depicts the catalytic mechanism of GDP-4-keto-6-deoxy-D-mannose 
epimerase/reductase (converts GDP-4-keto-6-deoxy-D-mannose to GDP-L-fucose) 
5 (Chang, etal., \9W,J.BioL Chem. 263:1693-1697; Barber, 1980, Plant Physiol. 66:326- 
329). 

DETAILED DESCRIPTION OF THE INVENTION 
The present invention relates to a biosynthetic method and production 
microorganisms and plants for producing vitamin C (ascorbic acid, L-ascorbic acid, or 

10 AA). Such a method includes fermentation of a genetically modified microorganism to 
produce L-ascorbic acid. In particular, the present invention relates to the use of 
nucleotide sequences encoding epimerases, including the endogenous GDP-D- 
mannose: GDP-I .-galactose epimerase from the L-ascorbic acid pathway, as well as 
epimerases having structural homology (e.g., by nucleotide/amino acid sequence and/or 

1 5 tertiary structure of the encoded protein) to GDP-4-keto-6-deoxy-D-mannose epimerase/ 
reductases, or UDP-galactose 4-epimerases, for the purposes of improving the 
biosynthetic production of ascorbic acid. The present invention also relates to genetically 
modified microorganisms, such as strains of microalgae, bacteria and yeast useful for 
producing L-ascorbic acid, and to genetically modified plants, useful for producing 

20 consumable plant food products. 

One embodiment of the present invention relates to a method to produce L- 
ascorbic acid by fermentation of a genetically modified microorganism. This method 
includes the steps of (a) culturing in a fermentation medium a microorganism having a 
genetic modification to increase the action of an enzyme selected from the group of 

25 hexokinase, glucose phosphate isomerase, phosphomannose isomerase, 
phosphomannomutase, GDP-mannose pyrophosphorylase, GDP-D-mannose :GDP-L- 
galactose epimerase, GDP-L-galactose phosphorylase, L-galactose-l-P-phosphatase, L- 
galactose dehydrogenase, and L-galactono-y-lactone dehydrogenase; and (b) recovering 
L-ascorbic acid or esters thereof. The various enzymes in this list represent the enzymes 

30 involved in the vitamin C biosynthetic pathway in plants. It is uncertain at this time 
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whether the enzyme represented by GDP-L-galactose phosphorylase is actually a 
phosphorylase or a pyrophosphorylase (i.e., GDP-L-galactose pyrophosphorylase). 
Therefore, use of the term "GDP-L-galactose phosphorylase" herein refers to either GDP- 
L-galactose phosphorylase or GDP-L-galactose pyrophosphorylase. In one aspect of the 
5 invention, this method includes the step of cuhuring in a fermentation medium a 
microorganism having a genetic modification to increase the action of an epimerase that 
catalyzes conversion of GDP-D-mannose to GDP-L-galactose. This aspect of the present 
invention is discussed in detail below. 

Another embodiment of the present invention relates to a genetically modified 

10 microorganism for producing L-ascorbic acid or esters thereof Another embodiment of 
the present invention relates to a genetically modified plant for producing L-ascorbic acid 
or esters thereof. Both genetically modified microorganisms (e.g., bacteria, yeast, 
microalgae) and plants (e.g., higher plants, microalgae) have a genetic modification to 
increase the action of an enzyme selected from the group of hexokinase, glucose 

1 5 phosphate isomerase, phosphomannose isomerase, phosphomannomutase, GDP-mannose 
pyrophosphorylase, GDP-D-mannose:GDP-L-galactose epimerase, GDP-L-galactose 
phosphorylase, L-galactose- 1 -P-phosphatase, L-galactose dehydrogenase, and/or L- 
galactono-y-lactone dehydrogenase. In a preferred embodiment, both genetically 
modified microorganisms (e.g., bacteria, yeast, microalgae) and plants (e.g., higher plants, 

20 microalgae) have a genetic modification to increase the action of an epimerase that 
catalyzes conversion of GDP-D-mannose to GDP-L-galactose. In one embodiment, the 
genetic modification includes the transformation of the microorganism or plant with the 
epimerase as described above. 

To produce significantly high yields of L-ascorbic acid by the method of the 

25 present invention, a plant and/or microorganism is genetically modified to enhance 
production of L-ascorbic acid. As used herein, a genetically modified plant (such as a 
higher plant or microalgae) or microorganism, such as a microalga (Prototheca, 
Chloretta\ Escherichia coli, or a yeast, is modified (i.e., mutated or changed) within its 
genome and/or by recombinant technology (i.e., genetic engineering) from its normal (i.e., 

30 wild-type or naturally occurring) form. In a preferred embodiment, a genetically modified 
plant or microorganism according to the present invention has been modified by 
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recombinant technology. Genetic modification of a plant or microorganism can be 
accomplished using classical strain development and/or molecular genetic techniques, 
include genetic engineering techniques. Such techniques are generally disclosed herein 
and are additionally disclosed, for example, in Sambrook et al, 1989, Molecular Cloning: 
5 A Laboratory Manual, Cold Spring Harbor Labs Press; Roessler, 1995, Plant Lipid 
Metabolism, pp. 46-48; and Roessler et al., 1994, in Bioconversion for Fuels, Himmel et 
al eds., American Chemical Society, Washington D C, pp 255-70). These references are 
incorporated by reference herein in their entirety. 

In some embodiments, a genetically modified plant or microorganism can include 

10 a natural genetic variant as well as a plant or microorganism in which nucleic acid 
molecules have been inserted, deleted or modified, including by mutation of endogenous 
genes (e.g., by insertion, deletion, substitution, and/or inversion of nucleotides), in such 
a manner that the modifications provide the desired effect within the plant or 
microorganism. As discussed above, a genetically modified plant or microorganism 

1 5 includes a plant or microorganism that has been modified using recombinant technology. 

As used herein, genetic modifications which result in a decrease in gene 
expression, an increase in inhibition of gene expression or inhibition of a gene product 
(i.e., the protein encoded by the gene), a decrease in the function of the gene, or a 
decrease in the function of the gene product can be referred to as inactivation (complete 

20 or partial), deletion, interruption, blockage, down-regulation, or decreased action of a 
gene. For example, a genetic modification in a gene which results in a decrease in the 
function of the protein encoded by such gene can be the result of a complete deletion of 
the gene encoding the protein (i.e., the gene does not exist, and therefore the protein does 
not exist), a mutation in the gene encoding the protein which results in incomplete or no 

25 translation of the protein (e.g., the protein is not expressed), or a mutation in the gene 
which decreases or abolishes the natural function of the protein (e.g., a protein is 
expressed which has decreased or no enzymatic activity). 

Genetic modifications which result in an increase in gene expression or function 
can be referred to as amplification, overproduction, overexpression, activation, 

30 enhancement, addition, up-regulation or increased action of a gene. Additionally, a genetic 
modification to a gene which modifies the expression, function, or activity of the gene can 
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have an impact on the action of other genes and their expression products within a given 
metabolic pathway (e.g., by inhibition or competition). In this embodiment, the action 
(e.g., activity) of a particular gene and/or its product can be affected (i.e., upregulated or 
downregulated) by a genetic modification to another gene within the same metabolic 

5 pathway, or to a gene within a different metabolic pathway which impacts the pathway of 
interest by competition, inhibition, substrate formation, etc. 

In general, a plant or microorganism having a genetic modification that affects L- 
ascorbic acid production has at least one genetic modification, as discussed above, which 
results in a change in the L-ascorbic acid production pathway as compared to a wild-type 

0 plant or microorganism grown or cultured under the same conditions. Such a 
modification in an L-ascorbic acid production pathway changes the ability of the plant or 
microorganism to produce L-ascorbic acid. According to the present invention, a 
genetically modified plant or microorganism preferably has an enhanced ability to produce 
L-ascorbic acid compared to a wild-type plant or microorganism cultured under the same 

5 conditions. 

The present invention is based on the present inventors' discovery of the 
biosynthetic pathway for L-ascorbic acid (vitamin C) in plants and microorganisms. Prior 
to the present invention, the metabolic pathway by which plants produce L-ascorbic acid, 
was not completely elucidated. The present inventors have demonstrated that L-ascorbic 

0 acid production in plants, including L-ascorbic acid-producing microorganisms (e.g., 
microalgae), is a pathway which uses GDP-D-mannose and involves sugar phosphates and 
NDP-sugars. In addition, the present inventors have made the surprising discovery that 
both L-galactose and L-galactono-y -lactone can be rapidly converted into L-ascorbic acid 
in L-ascorbic acid-producing microalgae, including Prototheca and Chlorelh 

5 pyrenoidosa. The entire pathway for L-ascorbic acid production in plants is set forth in 
Figs. 1A-1C. More particularly, Fig. 1A shows that the production of L-ascorbic acid in 
plants proceeds through the production of mannose intermediates to GDP-D-mannose, 
followed by the conversion of GDP-D-mannose to GDP-L-galactose by GDP-D- 
mannose:GDP-L-galactose epimerase (also known as GDP-D-mannose- 3 , 5-epimerase) 

0 (Fig. IB), and then by the subsequent progression to L-galactose- 1-P, L-galactose, L- 
galactonic acid (optional), L-galactono-y -lactone, and L-ascorbic acid (Fig. 1C). Fig. IB 
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also illustrates alternate pathways for the use of various intermediates, such as GDP-D- 
mannose. Certain aspects of this pathway have been independently described in a 
publication (Wheeler, et ah, 1998, Nature 393:365-369), incorporated herein by reference 
in its entirety. 

5 Points within the L-ascorbic acid production pathway which can be targeted by 

genetic modification to affect the production of L-ascorbic acid can generally be 
catagorized into at least one of the following pathways: (a) pathways affecting the 
production of GDP-D-mannose (e.g., pathways for converting a carbon source into GDP- 
D-mannose); (b) pathways for converting GDP-D-mannose into other compounds, (c) 

0 pathways associated with or downstream of the action of GDP-D-mannose: GDP-L- 
galactose epirnerase, (d) pathways which compete for substrates involved in the 
production of any of the intermediates within the L-ascorbic acid production pathway, and 
in particular, with GDP-D-mannose, GDP-L-galactose, L-galactose-1 -phosphate, L- 
galactose, L-galactono- y -lactone, and/or L-ascorbic acid; and (e) pathways which inhibit 

5 production of any of the intermediates within the L-ascorbic acid production pathway, and 
in particular, with GDP-D-mannose, GDP-L-galactose, L-galactose-1 -phosphate, L- 
galactose, L-galactono-y -lactone, and/or L-ascorbic acid. 

A genetically modified plant or microorganism useful in a method of the present 
invention typically has at least one genetic modification in the L-ascorbic acid production 

3 pathway which results in an enhanced production of L-ascorbic acid. In one embodiment, 
a genetically modified plant or microorganism has at least one genetic modification that 
results in: (a) an enhanced production of GDP-D-mannose; (b) an inhibition of pathways 
which convert GDP-D-mannose into compounds other than GDP-L-galactose; (c) an 
enhancement of action of the GDP-D-mannose: GDP-L-galactose epirnerase; (d) an 

5 enhancement of the action of enzymes downstream of the GDP-D-mannose : GDP-L- 
galactose epirnerase; (e) an inhibition of pathways which compete for substrates involved 
in the production of any of the intermediates within the L-ascorbic acid production 
pathway, and in particular, with GDP-D-mannose, GDP-L-galactose, L-galactose-1- 
phosphate, L-galactose, L-galactono-y-lactone, and/or L-ascorbic acid; and (e) an 

3 inhibition of pathways which inhibit production of any of the intermediates within the L- 
ascorbic acid production pathway, and in particular, with GDP-D-mannose, GDP-L- 
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galactose, L-galactose-1 -phosphate, L-galactose, L-galactono-y -lactone, and/or L- 
ascorbic acid. 

An enhanced production of GDP-D-mannose by genetic modification of the plant 
or microorganism can be achieved by, for example, overexpression of enzymes such as 

5 hexokinase, glucose phosphate isomerase, phosphomannose isomerase (PMI), 
phosphomannomutase (PMM) and/or GDP-D-mannose pyrophosphorylase (GMP) 
Inhibition of pathways which convert GDP-D-mannose to compounds other than GDP-L- 
galactose can be achieved, for example, by modifications which inhibit polysaccharide 
synthesis, GDP-D-rhamnose synthesis, GDP-L-fucose synthesis and/or GDP-D- 

0 mannuronic acid synthesis. An increase in the action of the GDP-D-mannose:GDP-L- 
galactose epimerase and of enzymes downstream of the epimerase in the L-ascorbic acid 
production pathway can be achieved by genetic modifications which include, but are not 
limited to: overexpression of the epimerase gene (i.e, by overexpression of a recombinant 
nucleic acid molecule encoding the epimerase gene or a homologue thereof (discussed in 

5 detail below), and/or by mutation of the endogenous or recombinant gene to enhance 
expression of the gene) and/or overexpression of genes downstream of the epimerase 
which encode subsequent enzymes in the L-ascorbic acid pathway. Finally, metabolic 
pathways which compete with or inhibit the L-ascorbic acid production pathway can be 
inhibited by deleting or mutating enzymes, substrates or products which either inhibit or 

D compete for an enzyme, substrate or product in the L-ascorbic acid pathway. 

As discussed above, a genetically modified plant or microorganism useful in the 
method of the present invention can have at least one genetic modification (e.g., mutation 
in the endogenous gene or addition of a recombinant gene) in a gene encoding an enzyme 
involved in the L-ascorbic acid production pathway. Such genetic modifications 

5 preferably increase (i.e., enhance) the action of such enzymes such that L-ascorbic acid 
is preferentially produced as compared to other possible end products in related metabolic 
pathways. Such genetic modifications include, but are not limited to, overexpression of 
the gene encoding such enzyme, and deletion, mutation, or downregulation of genes 
encoding competitors or inhibitors of such enzyme. Preferred enzymes for which the 

) action of the gene encoding such enzyme can be genetically modified include: hexokinase, 
glucose phosphate isomerase, phosphomannose isomerase (PMI), phosphomannomutase 
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(PMM), GDP-D-mannose pyrophosphorydase (GMP), GDP-D-mannose:GDP-L-galactose 
epimerase, GDP-L-galactose phosphorylase, L-galactose- 1-P-phosphatase, L-galactose 
dehydrogenase, and/or L-galactono-y-lactone dehydrogenase. More preferably, a 
genetically modified plant or microorganism useful in the present invention has a genetic 
5 modification which increases the action of an enzyme selected from the group of GDP-D- 
mannose: GDP-L-galactose epimerase, GDP-L-galactose phosphorylase, L-galactose-l-P- 
phosphatase, L-galactose dehydrogenase, and/or L-galactono-y-lactone dehydrogenase. 
Even more preferably, a genetically modified plant or microorganism useful in the present 
invention has a genetic modification which increases the action of GDP-D-mannose: GDP- 
10 L-galactose epimerase. These enzymes and the reactions catalyzed by such enzymes are 
illustrated in Figs. 1A-1C. 

Prior to the present invention, without knowing the L-ascorbic acid biosynthetic 
(i.e., production) pathway, previous mutagenesis and screening efforts were limited in that 
only non-lethal mutations could be detected. One embodiment of the present invention 
1 5 relates to elimination of a key competing enzyme that diverts carbon flow from L-ascorbic 
acid synthesis. If such enzyme is absolutely required for growth on glucose, then mutants 
lacking the enzyme (and, therefore, having increased carbon flow to L-ascorbic acid) 
would have been nonviable and not have been detected during prior screening efforts. 
One such enzyme is phosphofructokinase (PFK) (See Fig. 2A). PFK is required for 
20 growth on glucose, and is the major step drawing carbon away from L-ascorbic acid 
biosynthesis (Fig. 2A). Elimination of PFK would render the cells nonviable on glucose- 
based media. Selection of a conditional mutant where PFK was inactivated by temperature 
shift, for example, may allow development of a L-ascorbic acid process where cell growth 
is achieved under permissive fermentation conditions, and L-ascorbic acid production 
(from glucose) is initiated by a shift to non-permissive condition. In this example, the 
temperature shift would elirninate carbon flow from glucose to glycolysis via PFK, thereby 
shunting carbon into the L-ascorbic acid branch of metabolism. This approach has 
application not only in natural L-ascorbic acid producing organisms, but also in L-ascorbic 
acid recombinant systems (genetically engineered plant or microorganisms) as discussed 
herein. 



WO 99/646 1 8 PCT7US99/1 1 576 

15 

Knowing the identity and mechanism of the rate-limiting pathway enzymes in the 
L-ascorbic acid production pathway allows for design of specific inhibitors of the enzymes 
that are also growth inhibitory. Selection of mutants resistant to the inhibitors allows for 
the isolation of strains that contain L-ascorbic acid-pathway enzymes with more favorable 

5 kinetic properties. Therefore, one embodiment of the present invention is to identify 
inhibitors of the enzymes that are also growth inhibitory. These inhibitors are then used 
to select genetic mutants that overcome this inhibition and produce L-ascorbic acid at high 
levels. In this embodiment, the resultant plant or microorganism is a non-recombinant 
strain which can then be further modified by recombinant technology, if desired. In 

0 recombinant L-ascorbic acid producing strains, random mutagenesis and screening can be 
used as a final step to increase L-ascorbic acid production. 

In yet another embodiment genetic modifications are made to an L-ascorbic acid 
producing organism directly. This allows one to build upon a base of data acquired during 
prior classical strain improvement efforts, and perhaps more importantly, allows one to 

5 take advantage of undefined beneficial mutations that occurred during classical strain 
improvement. Furthermore, fewer problems are encountered when expressing native, 
rather than heterologous, genes. The most advanced system for development of genetic 
systems for microalgae has been developed for Chlamydomonas reinhardtii. Preferably, 
development of such a genetically modified production organism would include: isolation 

0 of mutant(s) with a specific nutritional requirement for use with a cloned selectable marker 
gene (similar to the ura3 mutants used in yeast and fungal systems); a cloned selectable 
marker such as URA3 or alternatively, identification and cloning of a gene that specifies 
resistance to a toxic compound (this would be analogous to the use of antibiotic resistance 
genes in bacterial systems, and, as is the case in yeast and other fimgi, a means of 

5 inserting/removing the marker gene repeatedly would be required, unless several different 
selectable markers were developed); a transformation system for introducing DNA into 
the production organism and achieving stable transformation and expression; and, a 
promoter system (preferably several) for high-level expression of cloned genes in the 
organism. 

0 Another embodiment of the present invention, discussed in detail below, is to place 

key genes or allelic variants and homologues thereof from L-ascorbic acid producing 
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organisms (i.e., higher plants and microalgae) into a plant or microorganism that is more 
amenable to molecular genetic manipulation, including endogenous L-ascorbic acid 
producing microorganisms and suitable plants. For example, it is possible to identify a 
suitable non-pathogenic organism based on the requirement of growth (on glucose) at low 

5 pH (i.e., acid-tolerant organisms, discussed in detail below). 

One suitable candidate for recombinant production in any suitable host organism 
is the gene (nucleic acid molecule) encoding GDP-D-manno se : GDP-L-galactose 
epirnerase and homologues of the GDP-D-mannose:GDP-L-galactose epimerase, as well 
as any other epimerase that has structural homology at the primary (i.e., sequence) or 

0 tertiary (i.e., three dimensional) level, to a GDP-4-keto-6-deoxy-D-mannose epimerase/ 
reductase, or to a UDP-galactose 4 -epimerase. Many microorganisms produce GDP-D- 
manno se as a precursor to exopolysaccharide and glycoprotein production, even though 
such organisms may not make L-ascorbic acid. This aspect of the present invention is 
discussed in detail below. 

5 Referring to Figs. 1 A-1C, at least some of the enzymes from glucose-6-phosphate 

to GDP-D-mannose are present in many organisms. In feet, the entire sequence is present 
in bacteria such as Azotobacter vinelanchi and Pseudomonas aeruginosa, and make up the 
early steps in the biosynthesis of the exopolysaccharide alginate. In this regard, it is 
possible that the only thing preventing these organisms from producing L-ascorbic acid 

0 could be the lack of GDP-D-mannose: GDP -L-galactose epimerase The presence of PMI 7 
PMM and GMP (see Fig. 1 A) in so many organisms is important for two reasons. First, 
these organisms themselves could serve as alternate hosts for L-ascorbic acid production, 
by building on the existing early pathway enzymes and adding the required cloned genes 
(the epimerase and possibly others). Second, the genes encoding PMI, PMM and GMP 

5 can be cloned into a new organism where, together with the cloned epimerase, they would 
encode the overall pathway from ghicose-6-phosphate to GDP-L- galactose. 

In order to screen genomic DNA or cDNA libraries from different organisms and 
to isolate nucleic acid molecules encoding these enzymes such as the GDP-D- 
mannose:GDP-L-galactose epirnerase, one can use any of a variety of standard molecular 

0 and biochemical techniques. For example, the GDP-D-mannose: GDP-L-galactose 
epimerase can be purified from an organism such as Prototheca, the N -terminal amino 
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acid sequence can be determined (including, if necessary, the sequence of internal peptide 
fragments), and this information can be used to design degenerate primers for amplifying 
a gene fragment from the organism's DNA This fragment would then be used to probe 
the library, and subsequently fragments that hybridize to the probe would be cloned in that 
5 organism or another suitable production organism. There is ample precedent for plant 
enzymes being expressed in an active form in bacteria, such as E. coli. Alternatively, 
yeast are also a suitable candidate for developing a heterologous system for L-ascorbic 
acid production. 

It is to be understood that the present invention discloses a method comprising the 

0 use of a microorganism with an ability to produce commercially useful amounts of L- 
ascorbic acid in a fermentation process (i.e., preferably an enhanced ability to produce L- 
ascorbic acid compared to a wild-type microorganism cultured under the same 
conditions). This method is achieved by the genetic modification of one or more genes 
encoding a protein involved in an L-ascorbic acid pathway which results in the production 

5 (expression) of a protein having an altered (e.g., increased or decreased) function as 
compared to the corresponding wild-type protein. Preferably, such genetic modification 
is achieved by recombinant technology. It will be appreciated by those of skill in the art 
that production of genetically modified plants or microorganisms having a particular 
altered function as described elsewhere herein (e.g., an enhanced ability to produce GDP- 

0 D-mannose:GDP-L-galactose epimerase), such as by transformation of the plant or 
microorganism with a nucleic acid molecule which encodes a particular enzyme, can 
produce many organisms meeting the given functional requirement, albeit by virtue of a 
variety of different genetic modifications. For example, different random nucleotide 
deletions and/or substitutions in a given nucleic acid sequence may all give rise to the 

5 same phenorypic result (e.g., decreased enzymatic activity of the protein encoded by the 
sequence). The present invention contemplates any such genetic modification which 
results in the production of a plant or microorganism having the characteristics set forth 
herein. 

A microorganism to be used in the fermentation method of the present invention 
0 is preferably a bacterium, a fungus, or a microalga which has been genetically modified 
according to the disclosure above. More preferably, a microorganism useful in the present 
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invention is a microalga which is capable of producing L-ascorbic acid, although the 
present invention includes microorganisms which are genetically engineered to produce 
L-ascorbic acid using the knowledge of the key components of the pathway and the 
guidance provided herein. Even more preferably, a microorganism useful in the present 
invention is an acid-tolerant microorganism, such as microalgae of the genera Prototheca 
and Chlorella Acid-tolerant yeast and bacteria are also known in the art Acid-tolerant 
microorganisms are discussed in detail below. Particularly preferred microalgae include 
microalgae of the genera, Prototheca and Chlorella, with Prototheca being most 
preferred. All known species of Prototheca produce L-ascorbic acid. Production of 
ascorbic acid by microalgae of the genera Prototheca and Chlorella is described in detail 
in U.S. Patent No. 5,792,631, issued August 11, 1998, and in U.S. Patent No. 5,900,370, 
issued May 4, 1999, both of which are incorporated herein by reference in their entirety. 
Preferred bacteria for use in the present invention include, but are not limited to, 
Azotobacter, Pseudomonas, and Escherichia, although acid-tolerant bacteria are more 
preferred. Preferred fungi for use in the present invention include yeast, and more 
preferably, yeast of the genus, Saccharomyces. A microorganism for use in the 
fermentation method of the present invention can also be referred to as a production 
organism. According to the present invention, microalgae can be referred to herein either 
as microorganisms or as plants. 

A preferred plant to genetically modify according to the present invention is 
preferably a plant suitable for consumption by animals, including humans. More 
preferably, such a plant is a plant that naturally produces L-ascorbic acid, although other 
plants can be genetically modified to produce L-ascorbic acid using the guidance provided 
herein. 

The L-ascorbic acid production pathways of the microalgae Prototheca and 
Chlorella pyrenoiaosa will be addressed as specific embodiments of the present invention 
are described below. It will be appreciated that other plants and, in particular, other 
microorganisms, have similar L-ascorbic acid pathways and genes and proteins having 
similar structure and function within such pathways. It will also be appreciated that plants 
and microorganisms which do not naturally produce L-ascorbic acid can be modified 
according to the present invention to produce L-ascorbic acid. As such, the principles 
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discussed below with regard to Prototheca and ChloreJla pyrenoidosa are applicable to 
other plants and microorganisms, including genetically modified plants and 
microorganisms. 

In one embodiment of the present invention, the action of an enzyme in the L- 

5 ascorbic acid production pathway is increased by amplification of the expression (i.e., 
overexpression) of an enzyme in the pathway, and particularly, the GDP-D- 
mannose:GDP-L-galactose epimerase, homologues of the epimerase, and/or enzymes 
downstream of the epimerase. Overexpression of an enzyme can be accomplished, for 
example, by introduction of a recombinant nucleic acid molecule encoding the enzyme. 

0 It is preferred that the gene encoding an enzyme in the L-ascorbic acid production 
pathway be cloned under control of an artificial promoter. The promoter can be any 
suitable promoter that will provide a level of enzyme expression required to maintain a 
sufficient level of L-ascorbic acid in the production organism. Preferred promoters are 
constitutive (rather than inducible) promoters, since the need for addition of expensive 

5 inducers is therefore obviated. The gene dosage (copy number) of a recombinant nucleic 
acid molecule according to the present invention can be varied according to the 
requirements for maximum product formation. In one embodiment, the recombinant 
nucleic acid molecule encoding a gene in the L-ascorbic acid production pathway is 
integrated into the chromosomes of the microorganism. 

0 It is another embodiment of the present invention to provide a microorganism 

having one or more enzymes in the L-ascorbic acid production pathway with improved 
affinity for its substrates. An enzyme with improved affinity for its substrates can be 
produced by any suitable method of genetic modification or protein engineering. For 
example, computer-based protein engineering can be used to design an epimerase protein 

5 with greater stability and better affinity for its substrate. See for example, Maulik et al., 
1997, Molecular Biotechnology: Therapeutic Applications and Strategies, Wiley-Liss, 
Inc., which is incorporated herein by reference in its entirety. 

Recombinant nucleic acid molecules encoding proteins in the L-ascorbic acid 
production pathway can be modified to enhance or reduce the function (i.e., activity) of 

D the protein, as desired to increase L-ascorbic acid production, by any suitable method of 
genetic modification. For example, a recombinant nucleic acid molecule encoding an 
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enzyme can be modified by any method for inserting, deleting, and/or substituting 
nucleotides, such as by error-prone PCR In this method, the gene is amplified under 
conditions that lead to a high frequency of misincorporation errors by the DNA 
polymerase used for the amplification. As a result, a high frequency of mutations are 
5 obtained in the PCR products. The resulting gene mutants can then be screened for 
enhanced substrate affinity, enhanced enzymatic activity, or reduced/increased inhibitory 
ability by testing the mutant genes for the ability to confer increased L- ascorbic acid 
production onto a test microorganism, as compared to a microorganism carrying the non- 
mutated recombinant nucleic acid molecule. 
1 0 Another embodiment of the present invention includes a microorganism in which 

competitive side reactions are blocked, including all reactions for which GDP-D-mannose 
is a substrate other than the production of L-ascorbic acid. In a preferred embodiment, 
a microorganism having complete or partial inactivation (decrease in the action of) of 
genes encoding enzymes which compete with the GDP-D-mannose: GDP -L-galactose 
1 5 epimerase for the GDP-D-mannose substrate is provided. Such enzymes include GDP-D- 
mannase and/or GDP-D-mannose-dehydrogenase. As used herein, inactivation of a gene 
can refer to any modification of a gene which results in a decrease in the activity (i.e., 
expression or function) of such a gene, including attenuation of activity or complete 
deletion of activity. 

As discussed above, a particularly preferred aspect of the method to produce L- 
ascorbic acid by fermentation of a genetically modified microorganism of the present 
invention includes the step of cuhuring in a fermentation medium a microorganism having 
a genetic modification to increase the action of an epimerase that catalyzes conversion of 
GDP-D-mannose to GDP-L-galactose. According to the present invention, such an 
epimerase can include the endogenous GDP-D-mannose: GDP-L-galactose epimerase from 
the L-ascorbic acid pathway, described above, as well as any other epimerase that has 
structural homology at the primary (i.e., sequence) or tertiary (i.e., three dimensional) 
level, to a GDP-4-keto-6-deoxy-D-mannose epimerase/ reductase, or to a UDP-galactose 
4-epimerase. Such structural homology is discussed in detail below. Preferably, such an 
epimerase is capable of catalyzing the conversion of GDP-D-mannose to GDP-L- 
galactose. In one embodiment, the genetic modification includes transformation of the 
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microorganism with a recombinant nucleic acid molecule that expresses such an 
epimerase 

Therefore, the epimerase encompassed in the method and organisms of the 
present invention includes the endogenous epimerase which operates in the naturally 
5 occurring ascorbic acid biosynthetic pathway (referred to herein as GDP-D- 
mannose: GDP-L-galactose epimerase), GDP-4-keto-6-deoxy-D-mannose epimerase/ 
reductases, and any other epimerase which is capable of catalyzing the conversion of 
GDP-D mannose to GDP-L-galactose and which is structurally homologous to a GDP-4- 
keto-6-deoxy-D-mannose epimerase/reductase or a UDP-galactose 4-epimerase. An 

1 0 epimerase that catalyzes conversion of GDP-D-mannose to GDP-L-galactose according 
the present invention can be identified by biochemical and functional characteristics as well 
as structural characteristics. For example, an epimerase according to the present invention 
is capable of acting on GDP-D-mannose as a substrate, and more particularly, such an 
epimerase is capable of catalyzing the conversion of GDP-D-mannose to GDP-L- 

1 5 galactose. It is to be understood that such capabilities need not necessarily be the normal 
or natural function of the epimerase as it acts in its endogenous (i.e., natural) environment. 
For example, GDP-4-keto-6-deoxy-D-mannose epimerase/reductase in its natural 
environment under normal conditions, catalyzes the conversion of GDP-D-mannose to 
GDP-L-fucose and does not act directly on GDP-D-mannose (See Fig. 8 A, B), however, 

20 such an epimerase is encompassed by the present invention for use in catalyzing the 
conversion of GDP-D-mannose to GDP-L-galactose for production of ascorbic acid, to 
the extent that it is capable o£ or can be modified to be capable o£ catalyzing the 
conversion of GDP-D-mannose to GDP-L-galactose. Therefore, the present invention 
includes epimerases which have the desired enzyme activity for use in production of 

25 ascorbic acid, are capable of having such desired enzyme activity, and/or are capable of 
being modified or induced to have such desired enzyme activity. 

In one embodiment, an epimerase according to the present invention includes an 
epimerase that catalyzes the reaction depicted in Fig. 7, In another embodiment, an 
epimerase according to the present invention includes an epimerase that catalyzes the first 

30 of the reactions depicted in Fig. 8B. In one embodiment, an epimerase according to the 
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present invention binds to NADPH. In another embodiment, an epimerase according to 
the present invention is NADPH-dependent for enzyme activity. 

As discussed above, the present inventors have discovered that a key enzyme in 
L-ascorbic acid biosynthesis in plants and microorganisms is GDP-D-mannose. GDP-L- 
5 galactose epimerase (refer to Figs. 1 A-1C). One embodiment of the invention described 
herein is directed to the manipulation of this enzyme and structural homologues of this 
enzyme to increase L-ascorbic acid production in genetically engineered plants and/or 
microorganisms. More particularly, the GDP-D-mannose:GDP-L-galactose epimerase of 
the l-ascorbic acid pathway and GDP-4-keto-6-deoxy-D-mannose epimerase/ reductases 

10 are believed to be structurally homologous at both the sequence and tertiary structure 
level; a GDP-4-keto-6-deoxy-D-mannose epimerase/ reductase is believed to be capable 
of functioning in the L-ascorbic acid biosynthetic pathway; and a GDP-4-keto-6-deoxy-D- 
mannose epimerase/reductase or homologue thereof may be superior to a GDP-D- 
mannose-GDP-L-galactose epimerase for increasing L-ascorbic acid production in 

1 5 genetically engineered plants and/or microorganisms. Furthermore, the present inventors 
disclose the use of a nucleotide sequence encoding all or part of a GDP-4-keto-6-deoxy- 
D-mannose epimerase/reductase as a probe to identity the gene encoding GDP-D- 
mannose: GDP-L-galactose epimerase. Similarly, the present inventors disclose the use 
of a nucleotide sequence of the gene encoding GDP-4-keto-6-deoxy-D-mannose 

20 epimerase/reductase to design oligonucleotide primers for use in a PCR-based strategy for 
identifying and cloning a gene encoding GDP-D-mannose: GDP-L-galactose epimerase. 

Without being bound by theory, the present inventors believe that the following 
evidence supports the novel concept that the GDP-D-mannose: GDP-L-galactose 
epimerase and GDP-4-keto-6-deoxy-D-mannose epimerase/reductases have significant 

25 structural homology at the level of sequence and/or tertiary structure, and that the GDP-4- 
keto-6-deoxy-D-mannose epimerase/reductases and/or homologues thereof would be 
useful for production of ascorbic acid and/or for isolating the endogenous GDP-D- 
mannose: GDP-L-galactose epimerase. 

Although prior to the present invention, it was not known that the GDP-D- 

30 mannose: GDP-L-galactose epimerase enzyme (also known as GDP-D-mannose-3,5- 
epimerase) plays a critical role in L-ascorbic acid biosynthesis, this enzyme was previously 
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described to catalyze the overall reversible reaction between GDP-D-mannose and GDP- 
L-galactose (Barber, 1971, Arch. Biochem. Biophys. 147:619-623; Barber, 1975, Arch. 
Biochem. Biophys. 167:718-722; Barber, 1979, J. Biol. Chem. 254:7600-7603; Hebda, 
et al., 1979, Arch Biochem. Biophys. 194:496-502; Barber and Hebda, 1982, Meth. 
5 Enzymol, 83:522-525). Despite these studies, GDP-D-mannose GDP-L-galactose 
epimerase has never been well characterized nor has the gene encoding this enzyme been 
cloned and sequenced. Since the original work by Barber, GDP-D-mannose: GDP-L- 
galactose epimerase activity has been detected in the colorless microalga Prototheca 
moriformis by the assignee of the present application, and in Arabidopsis thaliana and pea 

10 embryonic axes (Wheeler, et al., 1998, ibid.). 

Barber (1979, J. Biol Chem. 254:7600-7603) proposed a mechanism for GDP-D- 
mannose:GDP-L-galactose epimerase partially purified from the green microalga 
Chlorella pyrenoidosa. The overall conversion of GDP-D-mannose to GDP-L-galactose 
was proposed to proceed by oxidation of the hexosyl moiety at CM to a keto intermediate, 

1 5 ene-diol formation, and inversion of the configurations at C-3 and C-5 upon rehydration 
of the double bonds and stereospecific reduction of the keto group. The proposed 
mechanism is depicted in Fig. 7. 

Based on Barber's work, Feingold and Avigad (1980, In The Biochemistry of 
Plants, Vol. 3: Carbohydrates; Structure and Function, P.K. Stompf and E.E. Conn, eds., 

20 Academic Press, NY) elaborated further on the proposed mechanism for GDP-D- 
mannose : GDP-L-galactose epimerase. This mechanism is based on the assumption that 
the epimerase contains tightly bound NAD', and transfer of a hydride ion from C-4 of the 
substrate (GDP-D-mannose) to enzyme-associated NAD + converts the enzyme to the 
reduced (NADH)form, generating enzyme-bound GDP-4-keto-D-mannose. The latter 

25 would then undergo epimerization by an ene-diol mechanism. The final product (GDP-L- 
galactose) would be released from the enzyme after stereospecific transfer of the hydride 
ion originally removed from C-4, simultaneously regenerating the oxidized form of the 
enzyme. 

L-fiicose (6-deoxy-L-galactose) is a component of bacterial lipopoly saccharides, 
30 mammalian and plant glycoproteins and polysaccharides of plant cell walls. L-fiicose is 
synthesized de novo from GDP-D-mannose by the sequential action of GDP-D-mannose- 
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4,6-dehydratase (an NAD(P)-dependent enzyme), and a Afunctional GDP-4-keto-6- 
deoxy-D-mannose epimerase/reductase (NADPH-dependent), also referred to in scientific 
literature as GDP-fucose synthetase (Rizzi, et al., 1998, Structure 6: 1453-1465; Somers, 
et al., 1998, Structure 6: 1601-1612). This pathway for L-fucose biosynthesis appears to 
5 be ubiquitous (Rizzi, et al., 1998, Structure 6: 1453-1465). The mechanisms for GDP-D- 
mannose-4,6-dehydratase and GDP-4-keto-6-deoxy-D-mannose epimerase/reductase are 
shown in Fig. 8A, B (Chang, et al., 1988,7.5/0/. Chem. 263:1693-1697; Barber, 1980, 
Plant Physiol 66:326-329). 

Comparison of Figs. 7 and 8 A, B reveals that Barber's proposed mechanism for 

1 0 GDP-D-mannose : GDP-L-galactose epimerase is analogous to the reaction mechanism for 
GDP-4-keto-6-deoxy-D-mannose epimerase/reductase. The same mechanism has also 
been demonstrated for the epimerization reaction that occurs in the biosynthesis of two 
TDP-6-deoxy hexoses, TDP-L-rhamnose and TDP-6-deoxy-L-talose, from TDP-D- 
ghicose (Liu and Thorson, 1994, Ann. Rev. Microbiol. 48:223-256). In the latter cases, 

1 5 however, the final reduction at C-4 is catalyzed by NADPH-dependent reductases that are 
separate from the epimerase enzyme. These reductases have opposite stereospecificity, 
providing either TDP-L-rhamnose or TDP-6-deoxy-L-talose (Liu and Thorson, 1994, 
Ann. Rev. Microbiol. 48:223-256). 

In all of the mechanisms described above, NAD(P)H is required for the final 

20 reduction at C-4 (refer to Fig. 8B). In the work of Hebda, et al. (1979, Arch. Biochem. 
Biophys. 194:496-502), it was reported that GDP-D-mannose:GDP-L-galactose 
epimerase from C. pyrenoidosa did not require NAD, NADP or NADH for activity. 
Strangely, NADPH was not tested. Based on the analogous mechanisms shown in Figs. 
7 and 8A, B, the present inventors believe that it is likely that GDP-D-mannose: GDP-L- 

25 galactose epimerase from C. pyrenoidosa requires NADPH for the final reduction step. 
Why activity was detected in vitro without NADPH addition is not known, but tight 
♦binding of NADPH to the enzyme could explain this observation. On the other hand, if 
the proposed mechanism of Feingold and Avigad (1980, in The Biochemistry of Plants, 
Vol. 3, p. 101-170: Carbohydrates; Structure and Function, P.K. Stompf and E E. Conn, 

30 ed., Academic Press, NY) is correct, the reduced enzyme-bound cofactor generated in the 
first oxidation step of the epimerase reaction would serve as the source of electrons for 
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the final reduction of the keto group at C-4 back to the alcohol. Thus no addition of 
exogenous reduced cofactor would be required for activity in vitro. 

Recently, a human gene encoding the bifunctional GDP-4-keto-6-deoxy-D- 
mannose epimerase/reductase was cloned and sequenced (Tonetti, et al., 1996, J. Biol. 
5 Chem. 271-27274-27279). This amino acid sequence of the human GDP^-keto-6-deoxy- 
D-mannose epimerase/reductase shows significant homology (29% identity) to the K coli 
GDP-4-keto-6-deoxy-I>rnarinose epimerase/reductase (Tonetti, et al., 1998, Acta Cryst. 
D54:684-686; Somers, et al., 1998, Structure 6:1601-1612, both of which are 
incorporated herein by reference in their entireties). Tonetti et al. and Somers et al. 

1 0 additionally disclosed the tertiary (three dimensional) structure of the K coli GDP-4-keto- 
6-deoxy-D-mannose epimerase/reductase (also known as GDP-fucose synthetase), and 
noted significant structural homology with another epirnerase, UDP-galactose 4-epimerase 
(GalE). These epimerases also share significant homology at the sequence level Since 
no gene encoding a GDP-D-mannose : GDP-1 ^-galactose epirnerase has been cloned and 

1 5 sequenced, homology with genes encoding GDP-4-keto-6-deoxy-D-mannose epirnerase/ 
reductases or with genes encoding a UDP-galactose 4-epimerase has not been 
demonstrated. However, based on the similarity of the reaction products for GDP-D- 
mannose GDP-L-galactose epirnerase and GDP-4-keto-6-deoxy-D-mannose epirnerase/ 
reductase (i.e., GDP-L-galactose and GDP-6-deoxy-L-galactose [i.e., GDP-L-fucose], 

20 respectively) and the common catalytic mechanisms (Figs. 7 and 8A, B) the present 
inventors believe that the genes encoding the enzymes will have a high degree of sequence 
homology, as well as tertiary structural homology. 

Significant structural homology between GDP-D-mannose:GDP-L-galactose 
epirnerase and GDP-4-keto-6^eoxy-I>rnannose epirnerase/ reductases may allow a GDP- 

25 4-keto-6^eoxy-D-mannose epimerase/reductase, or a homologue thereof, to function in 
the L-ascorbic acid biosynthetic pathway, and a GDP-4-keto-6-deoxy-D-mannose 
epimerase/reductase could potentially be even better than a GDP-D-mannose-GDP-L- 
galactose epirnerase for increasing L-ascorbic acid production in genetically engineered 
plants and/or microorganisms. Furthermore, a nucleotide sequence encoding all or part 

30 of a GDP-4-keto-6-deoxy-D-mannose epimerase/reductase can be used as a probe to 
identify the gene encoding GDP-D-mannose: GDP-L-galactose epirnerase. Likewise, the 
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nucleotide sequence of the gene encoding GDP-4-keto-6-deoxy-D-mannose epimerase/ 
reductase can be used to design oligonucleotide primers for use in a PCR-based strategy 
for identifying and cloning a gene encoding GDP-D-mannose: GDP-L-galactose 
epimerase. 

5 The ability to substitute GDP-4-keto-6-D-mannose epimerase/reductase for GDP- 

D-mannose:GDP-L-galactose epimerase to enhance L-ascorbic acid biosynthesis in plants 
or rnicroorganisms depends on the ability of GDP-4-keto-6-deoxy-D-mannose epimerase/ 
reductase to act directly on GDP-D-mannose to form GDP-L-galactose. Evidence 
supporting this possibility already exists. Arabidopsis thaliana murl mutants are defective 

10 in GDP-D-mannose-4, 6-dehydratase activity (Bonin, et al., 1997, Proc. Natl Acad Set 
94:2085-2090). These mutants are thus blocked in GDP-L-fucose biosynthesis, and 
consequently have less than 2% of the normal amounts of L-fucose in the primary cell 
walls of aerial portions of the plant (Zabiackis, et al., 1996, Science 272:1808-1810). The 
murl mutants are more brittle than wild-type plants, are slightly dwarfed and have an 

15 apparently normal life cycle (Zabiackis, et al., 272: 1808-1810). When murl mutants are 
grown in the presence of exogenous L-fucose, the L-fucose content in the plant is restored 
to the wild-type state (Bonin, et al., 1997, Proc. Natl Acad. Set 94:2085-2090). It was 
discovered (Zabiackis, et al., 1996, Science 272: 1808-18 10) that murl mutants contain, 
in the hemicellulose xyloglucan component of the primary cell wall, L-galactose in place 

20 of the normal L-fucose L-galactose is not normally found in the xyloglucan component, 
but in murl mutants L-galactose partly replaces the terminal L-fucosyl residue. Bonin, et 
al. (1997, Proc, Natl Acad Set 94:2085-2090) hypothesized that in the absence of a 
functional GDP-D-mannose-4,6-dehydratase in the murl mutants, the GDP-4-keto-6- 
deoxy-D-mannose epimerase/reductase normally involved in L-fucose synthesis may be 

25 able to use GDP-D-mannose directly, forming GDP-L-galactose. Another possibility, 
however, is that the enzymes involved in L-ascorbic acid biosynthesis in A. thaliana are 
responsible for forming GDP-L-galactose in the murl mutant. If this were true, it would 
suggest that in the wild-type plant, some mechanism exists that prevents GDP-L-galactose 
formed in the L-ascorbic acid pathway from entering cell wall biosynthesis and 

30 substituting for (competing with) GDP-L-fucose for incorporation into the xyloglucan 
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component (since L-galactose is not present in the primary cell wall of the wild-type 
plant). 

Because of the similar reaction mechanisms of GDP-D-mannose:GDP-L-galactose 
epimerase and GDP-4-keto-6-deoxy-D-mannose epimerase/reductase, and because of the 
5 evidence that GDP-4-keto-6-deoxy-D-mannose epimerase/reductase can act directly on 
GDP-D-mannose to form GDP -L-galactose, the present inventors believe that genes 
encoding all epimerases and epimerase/reductases that act on GDP-D-mannose have high 
homology. As such, one aspect of the present invention relates to the use of any 
epimerase (and nucleic acid sequences encoding such epimerase) having significant 

1 0 homology (at the primary, secondary and/or tertiary structure level) to a GDP-4-keto-6- 
deoxy-D-mannose epimerase/reductase or to a UDP-galactose 4-epimerase for the 
purpose of improving the biosynthetic production of L-ascorbic acid. 

Therefore, as described above, one embodiment of the present invention relates 
to a method for producing ascorbic acid or esters thereof in a microorganism, which 

1 5 includes culturing a microorganism having a genetic modification to increase the action 
of an epimerase that catalyzes conversion of GDP-D-mannose to GDP-L-galactose. Also 
included in the present invention are genetically modified microorganisms and plants in 
which the genetic modification increases the action of an epimerase that catalyzes 
conversion of GDP-D-mannose to GDP-L-galactose. 

20 According to the present invention, an increase in the action of the GDP-D- 

mannose:GDP-L-galactose epimerase in the L-ascorbic acid production pathway can be 
achieved by genetic modifications which include, but are not limited to overexpression of 
the GDP-I>rriaruK>se: GDP-L-galactose epimerase gene, a homologue of such gene, or of 
any recombinant nucleic acid sequence encoding an epimerase that is homologous in 

25 primary (nucleic acid or amino acid sequence) or tertiary (three dimensional protein) 
structure to a GDP^-keto-6-deoxy-D-mannose epimerase/reductase or a UDP-galactose 
4-epimerase, such as by overexpression of a recombinant nucleic acid molecule encoding 
the epimerase gene or a homologue thereof, and/or by mutation of the endogenous or 
recombinant gene to enhance expression of the gene. 

30 According to the present invention, an epimerase that has a tertiary structure that 

is homologous to the tertiary structure of a GDP-4-keto-6-deoxy-D-mannose epimerase/ 



WO 99/64618 PCT/US99/1 1576 

28 

reductase is an epimerase that has a tertiary structure that substantially conforms to the 
tertiary structure of a GDP-4-keto-6-deoxy-D-mannose epimerase/reductase represented 
by the atomic coordinates having Brookhaven Protein Data Bank Accession Code lbws 
(Table 12). In another embodiment, an epimerase that has a tertiary structure that is 
5 homologous to the tertiary structure of a GDP-4-keto-6-deoxy-D-mannose epimerase/ 
reductase is an epimerase that has a tertiary structure that substantially conforms to the 
tertiary structure of a GDP-4-keto-6-deoxy-D-mannose epimerase/reductase represented 
by the atomic coordinates having Brookhaven Protein Data Bank Accession Code 1GFS. 
As used herein, a "tertiary structure" or "three dimensional structure" of a protein, such 

1 0 terms being interchangeable, refers to the components and the manner of arrangement of 
the components in three dimensional space to constitute the protein. The use of the term 
"substantially conforms" refers to at least a portion of a tertiary structure of an epimerase 
which is sufficiently spatially similar to at least a portion of a specified three dimensional 
configuration of a particular set of atomic coordinates (e.g., those represented by 

1 5 Brookhaven Protein Data Bank Accession Code lbws) to allow the tertiary structure of 
at least said portion of the epimerase to be modeled or calculated (i.e., by molecular 
replacement) using the particular set of atomic coordinates as a basis for estimating the 
atomic coordinates defining the three dimensional configuration of the epimerase. 

More particularly, a tertiary structure that substantially conforms to a given set of 

20 atomic coordinates is a structure having an average root-mean-square deviation (RMSD) 
of less than about 2.5 A, and more preferably, less than about 2 A, and, in increasing 
preference, less than about 1.5 A, less than about 1 A, less than about 0.5 A, and most 
preferably, less than about 0.3 A, over at least about 25% of the Ccc positions as 
compared to the tertiary structure of a GDP-4-keto-6-deoxy-D-mannose epimerase/ 

25 reductase represented by the atomic coordinates having Brookhaven Protein Data Bank 
Accession Code lbws. In other embodiments, a structure that substantially conforms to 
a given set of atomic coordinates is a structure wherein such structure has the recited 
average root-mean-square deviation (RMSD) value over at least about 50% of the Cot 
positions as compared to the tertiary structure of a GDP-4-keto-6-deoxy-D-mannose 

30 epimerase/reductase represented by the atomic coordinates having Brookhaven Protein 
Data Bank Accession Code lbws, and in another embodiment, such structure has the 
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recited average root-mean-square deviation (RMSD) value over at least about 75% of the 
Ca positions as compared to the tertiary structure of a GDP-4-keto-6-deoxy-D-mannose 
epimerase/reductase represented by the atomic coordinates having Brookhaven Protein 
Data Bank Accession Code lbws, and in another embodiment, such structure has the 
5 recited average root-mean-square deviation (RMSD) value over about 100% of the Co 
positions as compared to the tertiary structure of a GDP-4-keto-6-deoxy-D-mannose 
epimerase/reductase represented by the atomic coordinates having Brookhaven Protein 
Data Bank Accession Code Ibws. Methods to calculate RMSD values are well known 
in the art. Various software programs for determining the tertiary structural homology 

10 between one or more proteins are known in the art and are publicly available, such as 
QUANTA (Molecular Simulations Inc.). 

A preferred epimerase that catalyzes conversion of GDP-D-mannose to GDP-L- 
galactose according to the method and genetically modified organisms of the present 
invention includes an epimerase that comprises a substrate binding she having a tertiary 

1 5 structure that substantially conforms to the tertiary structure of the substrate binding site 
of a GDP-4-keto-6-deoxy-D-mannose epimerase/reductase represented by the atomic 
coordinates having Brookhaven Protein Data Bank Accession Code lbws. Preferably, the 
tertiary structure of the substrate binding site of the epimerase has an average root-mean- 
square deviation (RMSD) of less than about 2.5 A, and more preferably, less than about 

20 2 A, and, in increasing preference, less than about 1 .5 A, less than about 1 A, less than 
about 0.5 A, and most preferably, less than about 0.3 A, over at least about 25% of the 
Ca positions as compared to the tertiary structure of the substrate binding site of a GDP- 
4-keto-6-deoxy-D-rnannose epimerase/reductase represented by the atomic coordinates 
having Brookhaven Protein Data Bank Accession Code lbws. In other embodiments, the 

25 tertiary structure of the substrate binding site of the epimerase has the recited average 
root-mean-square deviation (RMSD) value over at least about 50% of the Ca positions 
as compared to the tertiary structure of the substrate binding site of a GDP-4-keto-6- 
deoxy-D-mannose epimerase/reductase represented by the atomic coordinates having 
Brookhaven Protein Data Bank Accession Code lbws, and in another embodiment, the 

30 tertiary structure of the substrate binding site of the epimerase has the recited average 
root-mean-square deviation (RMSD) value over at least about 75% of the Ca positions 
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as compared to the tertiary structure of the substrate binding site of a GDP-4-keto-6- 
deoxy-D-mannose epimerase/reductase represented by the atomic coordinates having 
Brookhaven Protein Data Bank Accession Code lbws, and in another embodiment, the 
tertiary structure of the substrate binding site of the epimerase has the recited average 
5 root-mean-square deviation (RMSD) value over about 100% of the Ccc positions as 
compared to the tertiary structure of the substrate binding site of a GDP-4-keto-6-deoxy- 
D-mannose epimerase/reductase represented by the atomic coordinates having 
Brookhaven Protein Data Bank Accession Code lbws. The tertiary structure of the 
substrate binding site of a GDP-4-keto-6-deoxy-D-mannose epimerase/reductase 

1 0 represented by the atomic coordinates having Brookhaven Protein Data Bank Accession 
Code lbws is discussed in detail in Rizzi et al., 1998, ibid. Additionally, the tertiary 
structure of the substrate binding site of a GDP-4-keto-6-deoxy-D-mannose epimerase/ 
reductase represented by the atomic coordinates having Brookhaven Protein Data Bank 
Accession Code 1GFS is discussed in detail in Somers et al., 1998, ibid. 

15 Another preferred epimerase according to the present invention includes an 

epimerase that comprises a catalytic site having a tertiary structure that substantially 
conforms to the tertiary structure of the catalytic site of a GDP-4-keto-6-deoxy-D- 
mannose epimerase/reductase represented by the atomic coordinates having Brookhaven 
Protein Data Bank Accession Code lbws. Preferably, the tertiary structure of the 

20 catalytic site of the epimerase has an average root-mean-square deviation (RMSD) of less 
than about 2.5 A, and more preferably, less than about 2 A, and, in increasing preference, 
less than about 1.5 A, less than about 1 A, less than about 0.5 A, and most preferably, less 
than about 0.3 A, over at least about 25% of the Cot positions as compared to the tertiary 
structure of the catalytic site of a GDP-4-keto-6-deoxy-D-mannose epimerase/reductase 

25 represented by the atomic coordinates having Brookhaven Protein Data Bank Accession 
Code lbws. In other embodiments, the tertiary structure of the catalytic site of the 
epimerase has the recited average root-mean-square deviation (RMSD) value over at least 
about 50% of the Ca positions as compared to the tertiary structure of the catalytic site 
of a GDP-4-keto-6-deoxy-D-mannose epimerase/reductase represented by the atomic 

30 coordinates having Brookhaven Protein Data Bank Accession Code lbws, and in another 
embodiment, the tertiary structure of the catalytic site of the epimerase has the recited 
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average root-mean-square deviation (RMSD) value over at least about 75% of the Ca 
positions as compared to the tertiary structure of the catalytic site of a GDP-4-keto-6- 
deoxy-D-mannose epimerase/reductase represented by the atomic coordinates having 
Brookhaven Protein Data Bank Accession Code lbws, and in another embodiment, the 
5 tertiary structure of the catalytic site of the epimerase has the recited average root-mean- 
square deviation (RMSD) value over 100% of the Ca positions as compared to the 
tertiary structure of the catalytic site of a GDP-4-keto-6~deoxy-D-mannose epimerase/ 
reductase represented by the atomic coordinates having Brookhaven Protein Data Bank 
Accession Code lbws. 

10 In one embodiment, an epimerase encompassed by the present invention includes 

an epimerase that has a catalytic site which includes amino acid residues: serine, tyrosine 
and lysine. In a preferred embodiment, the tertiary structure positions of the amino acid 
residues serine, tyrosine and lysine substantially conform to the tertiary structure position 
of residues Serl07, Tyrl36 and Lysl40, respectively, as represented by atomic 

15 coordinates in Brookhaven Protein Data Bank Accession Code lbws. The tertiary 
structure of the catalytic site of a GDP-4-ke£o-6-deoxy-D-mannose epimerase/reductase 
represented by the atomic coordinates having Brookhaven Protein Data Bank Accession 
Code lbws is discussed in detail in Rizzi et al., 1998, ibid. Additionally, the tertiary 
structure of the catalytic site of a GDP-4-keto-6-deoxy-D-mannose epimerase/reductase 

20 represented by the atomic coordinates having Brookhaven Protein Data Bank Accession 
Code 1GFS is discussed in detail in Somers et al., 1998, ibid. 

In an even more preferred embodiment, the above definition of "substantially 
conforms" can be further defined to include atoms of amino acid side chains. As used 
herein, the phrase "common amino acid side chains'* refers to amino acid side chains that 

25 are common to both the structures which substantially conforms to a given set of atomic 
coordinates and the structure that is actually represented by such atomic coordinates. 
Preferably, a tertiary structure that substantially conforms to a given set of atomic 
coordinates is a structure having an average root-mean-square deviation (RMSD) of less 
than about 2.5 A, and more preferably, less than about 2 A, and, in increasing preference, 

30 less than about 1 .5 A, less than about 1 A, less than about 0.5 A, and most preferably, less 
than about 0.3 A over at least about 25% of the common amino acid side chains as 
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compared to the tertiary structure represented by the given set of atomic coordinates. In 
another embodiment, a structure that substantially conforms to a given set of atomic 
coordinates is a structure having the recited average root-mean-square deviation (RMSD) 
value over at least about 50% of the common amino acid side chains as compared to the 
5 tertiary structure represented by the given set of atomic coordinates, and in another 
embodiment, such structure has the recited average root-mean-square deviation (RMSD) 
value over at least about 75% of the common amino acid side chains as compared to the 
tertiary structure represented by the given set of atomic coordinates, and in another 
embodiment, such a structure has the recited average root-mean-square deviation 

10 (RMSD) value over 100% of the common amino acid side chains as compared to the 
tertiary structure represented by the given set of atomic coordinates. 

A tertiary structure of an epimerase which substantially conforms to a specified set 
of atomic coordinates can be modeled by a suitable modeling computer program such as 
MODELER (A. Sali and TX. Blundell, J. Mol Biol, vol. 234:779-815, 1993 as 

1 5 implemented in the Insight II Homology software package (Insight II (97.0), MSI, San 
Diego)), using information, for example, derived from the following data: (1) the amino 
acid sequence of the epimerase; (2) the amino acid sequence of the related portion(s) of 
the protein represented by the specified set of atomic coordinates having a three 
dimensional configuration; and, (3) the atomic coordinates of the specified three 

20 dimensional configuration. Alternatively, a tertiary structure of an epimerase which 
substantially conforms to a specified set of atomic coordinates can be modeled using data 
generated from analysis of a crystallized structure of the epimerase. A tertiary structure 
of an epimerase which substantially conforms to a specified set of atomic coordinates can 
also be calculated by a method such as molecular replacement. Methods of molecular 

25 replacement are generally known by those of skill in the art (generally described in 
Bnmger, Metk Enzym , vol. 276, pp. 558-580, 1997; Navaza and Saludjian, Metk 
Enzym., vol. 276, pp. 581-594, 1997; Tong and Rossmann, Metk Enzym., vol 276, pp. 
594-61 1, 1997; and Bentley, Metk Enzym., vol. 276, pp. 611-619, 1997, each of which 
are incorporated by this reference herein in their entirety) and are performed in a software 

30 program including, for example, XPLOR (Brunger, et al., Science, vol. 235, p. 458, 
1987) In addition, a structure can be modeled using techniques generally described by, 
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for example, Sali, Current Opinions in Biotechnology, vol. 6, pp. 437-451, 1995, and 
algorithms can be implemented in program packages such as Homology 95.0 (in the 
program Insight II, available from Biosym/MSI, San Diego, CA). Use of Homology 95.0 
requires an alignment of an amino acid sequence of a known structure having a known 
5 three dimensional structure with an amino acid sequence of a target structure to be 
modeled. The alignment can be a pairwise alignment or a multiple sequence alignment 
including other related sequences (for example, using the method generally described by 
Rost, Meth. Enzymol., vol. 266, pp. 525-539, 1996) to improve accuracy. Structurally 
conserved regions can be identified by comparing related structural features, or by 

10 examining the degree of sequence homology between the known structure and the target 
structure. Certain coordinates for the target structure are assigned using known structures 
from the known structure. Coordinates for other regions of the target structure can be 
generated from fragments obtained from known structures such as those found in the 
Protein Data Bank maintained by Brookhaven National Laboratory, Upton, NY. 

1 5 Conformation of side chains of the target structure can be assigned with reference to what 
is sterically allowable and using a library of rotamers and their frequency of occurrence 
(as generally described in Ponder and Richards, J. MoL Biol. 7 vol. 193, pp. 775-791, 
1987). The resulting model of the target structure, can be refined by molecular mechanics 
(such as embodied in the program Discover, available from Biosym/MSI) to ensure that 

20 the model is chemically and conformationally reasonable. 

According to the present invention, an epimerase that has a nucleic acid sequence 
that is homologous at the primary structure level (i.e., is a homologue of) to a nucleic acid 
sequence encoding a GDP-4-keto-6-deoxy-D-mannose epimerase/reductase or a UDP- 
galactose 4-epimerase includes any epimerase encoded by a nucleic acid sequence that is 

25 at least about 15%, and preferably at least about 20%, and more preferably at least about 
25%, and even more preferably, at least about 30% identical to a nucleic acid sequence 
encoding a GDP^-keto-6-deoxy-D-mannose epimerase/reductase or a UDP-galactose 4- 
epimerase, and preferably to a nucleic acid sequence selected from the group consisting 
of SEQ ID NO:l, SEQ ID NO:3, SEQ ID N0 5, SEQ ID NO:7 or SEQ ID NO:9. 

30 Similarly, an epimerase that has an amino acid sequence that is homologous to an amino 
acid sequence of a GDP-4-keto-6-deoxy-D-mannose epimerase/reductase or a UDP- 
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galactose 4-epimerase includes any epimerase having an amino acid sequence that is at 
least about 15%, and preferably at least about 20%, and more preferably at least about 
25%, and even more preferably, at least about 30% identical to an amino acid sequence 
of a GDP-4-keto-6-deoxy-D-mannose epimerase/reductase or a UDP-galactose 4- 
5 epimerase, and preferably to an amino acid sequence selected from the group consisting 
of SEQ ID NO:2, SEQ ID NO:4, SEQ ID NO:6, SEQ ID NO:8 or SEQ ID NO: 10. 

According to one embodiment of the present invention, homology or percent 
identity between two or more nucleic acid or amino acid sequences is performed using 
methods known in the art for aligning and/or calculating percentage identity. To compare 

10 the homology/ percent identity between two or more sequences as set forth above, for 
example, a module contained within DNASTAR (DNASTAR, Inc., Madison, Wisconsin) 
can be used. In particular, to calculate the percent identity between two nucleic acid or 
amino acid sequences, the Lipman-Pearson method, provided by the MegAlign module 
within the DNASTAR program, is preferably used, with the following parameters, also 

15 referred to herein as the Lipman-Pearson standard default parameters: 

(1) Ktuple = 2; 

(2) Gap penalty = 4; 

(3) Gap length penalty = 12. 

Using the Lipman-Pearson method with these parameters, for example, the percent 
20 identity between the amino acid sequence for K colt GDP-4-keto-6-deoxy-D-mannose 
epimerase/reductase (SEQ ID NO:4) and human GDP-4-keto-6-deoxy-D-mannose 
epimerase/reductase (FX) (SEQ ID NO:6) is 27.7%, which is comparable to the 27% 
identity described for these enzymes in Tonetti et al., 1998, Acta Cryst. D54:684-686. 
According to another embodiment of the present invention, to align two or more 
25 nucleic acid or amino acid sequences, for example to generate a consensus sequence or 
evaluate the similarity at various positions between such sequences, a CLUSTAL 
alignment program (e.g., CLUSTAL, CLUSTAL V, CLUSTAL W), also available as a 
module within the DNASTAR program, can be used using the following parameters, also 
referred to herein as the CLUSTAL standard default parameters: 
30 Multiple Alignment Parameters (\ e . for more than 2 sequences): 

(1) Gap penalty = 10; 
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(2) Gap length penalty = 10; 

Pairwise Alignment Parameters (i.e .. for two sequences) : 

(1) Ktuple-l; 

(2) Gap penalty = 3; 
5 (3) Window = 5; 

(4) Diagonals saved = 5. 

According to the present invention, a GDP-4-keto-6-deoxy-E)-mannose epimerase/ 
reductase can be a GDP-4-keto-6-deoxy-D-mannose epimerase/reductase from any 
organism, including Arabidopsis thaliana, Escherichia coli, and human. A nucleic acid 

10 sequence encoding a GDP-4-keto-6-deoxy-D-mannose epimerase/reductase from 
Arabidopsis thaliana is represented herein by SEQ TD NO: 1 . SEQ ID NO: 1 encodes a 
GDP-4-keto-6-deoxy-D-mannose epimerase/reductase having an amino acid sequence 
represented herein as SEQ ID NO:2. A nucleic acid sequence encoding a GDP-4-keto-6- 
deoxy-D-mannose epimerase/reductase from Escherichia coli is represented herein by 

15 SEQ ID NO:3. SEQ ID NO: 3 encodes a GDP-4-keto-6-deoxy-D-mannose epimerase/ 
reductase having an amino acid sequence represented herein as SEQ ID NO 4 A nucleic 
acid sequence encoding a GDP-4-keto-6-deoxy-D-mannose epimerase/reductase from 
homo sapiens is represented herein by SEQ ID NO:5. SEQ ID NO:5 encodes a GDP-4- 
keto-6-deoxy-D-mannose epimerase/reductase having an amino acid sequence represented 

20 herein as SEQ ID NO:6. 

According to the present invention, a UDP-galactose 4-epimerase can be a UDP- 
galactose 4-epimerase from any organism, including Escherichia coli and human. A 
nucleic acid sequence encoding a UDP-galactose 4-epimerase from Escherichia coli is 
represented herein by SEQ ID NO:7. SEQ ID NO:7 encodes a UDP-galactose 4- 

25 epimerase having an amino acid sequence represented herein as SEQ ID NO: 8 A nucleic 
acid sequence encoding a UDP-galactose 4-epimerase from homo sapiens is represented 
herein by SEQ ID NO:9 SEQ ID N0 9 encodes a UDP-galactose 4-epimerase having 
an amino acid sequence represented herein as SEQ ID NO: 10. 

In a preferred embodiment, an epimerase encompassed by the present invention 

30 has an amino acid sequence that aligns with the amino acid sequence of SEQ ID NO: 1 1 , 
for example using a CLUSTAL alignment program, wherein amino acid residues in the 
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amino acid sequence of the epimerase align with 100% identity with at least about 50% 
of non-Xaa residues in SEQ ID NO: 11, and preferably at least about 75% of non-Xaa 
residues in SEQ ID NO: 1 1, and more preferably, at least about 90% of non-Xaa residues 
in SEQ ID NO: 11, and even more preferably 100% of non-Xaa residues in SEQ ID 
5 NO: 11. The percent identity of residues aligning with 100% identity with non-Xaa 
residues can be simply calculated by dividing the number of 100% identical matches at 
non-Xaa residues in SEQ ID NO: 1 1 by the total number of non-Xaa residues in SEQ ID 
NO: 11. A preferred nucleic acid sequence encoding an epimerase encompassed by the 
present invention include a nucleic acid sequence encoding an epimerase having an amino 

10 acid sequence with the above described identity to SEQ ID NO: 1 1 . Such an alignment 
using a CLUSTAL alignment program is based on the same parameters as previously 
disclosed herein. SEQ ID NO: 1 1 represents a consensus amino acid sequence of an 
epimerase which was derived by aligning at least portions of amino acid sequences SEQ 
ID NO:4, SEQ ID NO:6 and SEQ ID NO:8, as described in Somers et al., 1 998, Structure 

15 6:1601-161 2, and can be approximately duplicated using CLUSTAL. 

In another embodiment, an epimerase encompassed by the present invention 
includes an epimerase that has a catalytic site which includes amino acid residues: serine, 
tyrosine and lysine. Preferably, such serine, tyrosine and lysine residues are located at 
positions in the epimerase amino acid sequence which align using a CLUSTAL alignment 

20 program with positions Serl05, Tyrl34 and Lysl38 of consensus sequence SEQ ID 
NO: 11, with positions Serl09, Tyrl38 and Lysl42 of sequence SEQ ID NO:2, with 
positions Serl07, Tyrl36 and Lysl40 of SEQ ID NO:4, with positions Serl 14, Tyrl43 
and Lysl47 of sequence SEQ ID NO:6, with positions Serl24, Tyrl49 and Lysl53 of 
sequence SEQ ID NO:8 or with positions Serl 32, Tyrl 57 and Lysl61 of sequence SEQ 

25 ID NO: 10. 

In another embodiment, an epimerase that has an amino acid sequence that is 
homologous to an amino acid sequence encoding a GDP-4-keto-6-deoxy-D-mannose 
epimerase/reductase includes any epimerase that has an amino acid motif: Gly-Xaa-Xaa- 
GHy-Xaa-Xaa-Gry, which is found, for example in positions 8 through 14 of the consensus 
30 sequence SEQ ID NO: 1 1, in positions 12 through 18 of SEQ ID NO:2, in positions 10 
through 16 of SEQ ID NO:4, in positions 14 through 20 of SEQ ID NO: 6, in positions 
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7 through 13 of SEQ ID NO:8, and in positions 9 through 15 of SEQ ID NO: 10. Such 
a motif can be identified by its alignment with the same motif in the above-identified amino 
acid sequences using a CLUSTAL alignment program. Preferably, such motif is located 
within the first 25 N-terminal amino acids of the amino acid sequence of the epimerase 
5 In yet another embodiment, an epimerase encompassed by the present invention 

includes an epimerase that has a substrate binding site which includes amino acid residues 
that align using a CLUSTAL alignment program with at least 50% of amino acid 
positions Asnl77, Serl78, Argl87, Arg209, Lys283, Asnl65, Serl07, Serl08, Cysl09, 
Asnl33, Tyrl36 and Hisl79 of SEQ ID NO:4. Alignment with positions Serl07, Tyrl36, 

10 Asnl65, Arg209, is preferably with 100% identity (i.e., exact match of residue, under 
parameters for alignment). 

In another embodiment of the present invention, an epimerase encompassed by the 
present invention comprises at least 4 contiguous amino acid residues having 100% 
identity with at least 4 contiguous amino acid residues of an amino acid sequence selected 

15 from the group of SEQ ID NO:2, SEQ ID NO:4, SEQ ID NO:6, SEQ ID NO:8 or SEQ 
ID NO: 10, as determined using a Lipman-Pearson method with Lipman-Pearson standard 
default parameters or by comparing an alignment using a CLUSTAL program with 
CLUSTAL standard default parameters. According to the present invention, the term 
"contiguous" means to be connected in an unbroken sequence. For a first sequence to 

20 have "100% identity" with a second sequence means that the first sequence exactly 
matches the second sequence with no gaps between nucleotides or amino acids. 

In another embodiment of the present invention, an epimerase encompassed by the 
present invention is encoded by a nucleic acid sequence that comprises at least 12 
contiguous nucleic acid residues having 100% identity with at least 12 contiguous nucleic 

25 acid residues of a nucleic acid sequence selected from the group of SEQ ID NO: 1, SEQ 
ID NO:3, SEQ ID NO:5, SEQ ID NO:7 or SEQ ID NO: 10, as determined using a 
Lipman-Pearson method with Lipman-Pearson standard default parameters or by 
comparing an alignment using a CLUSTAL program with CLUSTAL standard default 
parameters. 

30 In another ernbodiment of the present invention, an epimerase encompassed by the 

present invention is encoded by a nucleic acid sequence that hybridizes under stringent 
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hybridization conditions to a nucleic acid sequence selected from the group of SEQ ID 
NO l, SEQ ID NO:3, SEQ ID NO:5, SEQ ID NO:7 or SEQ ID NO:9. As used herein, 
stringent hybridization conditions refer to standard hybridization conditions under which 
nucleic acid molecules are used to identify similar nucleic acid molecules. Such standard 
5 conditions are disclosed, for example, in Sambrook et al., Molecular Cloning: A 
Laboratory Manual, Cold Spring Harbor Labs Press, 1989. Sambrook et al., ibid, is 
incorporated by reference herein in its entirety (see specifically, pages 9.31-9.62). In 
addition, formulae to calculate the appropriate hybridization and wash conditions to 
achieve hybridization permitting varying degrees of mismatch of nucleotides are disclosed, 

10 for example, in Meinkoth et al., 1984, Anal. Biochem. 138, 267-284; Meinkoth et al., 
ibid., is incorporated by reference herein in its entirety. 

More particularly, stringent hybridization and washing conditions, as referred to 
herein, refer to conditions which permit isolation of nucleic acid molecules having at least 
about 70% nucleic acid sequence identity with the nucleic acid molecule being used to 

15 probe in the hybridization reaction, more particularly at least about 75%, and most 
particularly at least about 80%. Such conditions will vary, depending on whether 
DNA RNA or DNA DNA hybrids are being formed. Calculated melting temperatures for 
DNA:DNA hybrids are 10°C less than for DNAiRNA hybrids. In particular 
embodiments, stringent hybridization conditions for DNA:DNA hybrids include 

20 hybridization at an ionic strength of 6X SSC (0.9 M Na*) at a temperature of between 
about 20°C and about 35°C, more preferably, between about 28°C and about 40°C, and 
even more preferably, between about 35°C and about 45 °C. In particular embodiments, 
stringent hybridization conditions for DNA RN A hybrids include hybridization at an ionic 
strength of 6X SSC (0.9 M Na + ) at a temperature of between about 30 °C and about 

25 45 °C, more preferably, between about 38°C and about 50°C, and even more preferably, 
between about 45 °C and about 55 °C. These values are based on calculations of a melting 
temperature for molecules larger than about 100 nucleotides, 0% formamide and a G + 
C content of about 40%. Alternatively, T m can be calculated empirically as set forth in 
Sambrook et al., supra, pages 9.3 1 to 9.62. 

30 In another embodiment of the present invention, an epimerase encompassed by the 

present invention is encoded by a nucleic acid sequence that comprises a nucleic acid 
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sequence selected from the group of SEQ ID NO: 1, SEQ ID NO:3, SEQ ID NO:5, SEQ 
ID NO:7 or SEQ ID NO: 9 or a fragment thereof wherein the fragment encodes a protein 
that is capable of catalyzing the conversion of GDP-D-maimose to GDP-L-galactose, such 
as under physiological conditions. In another embodiment, an epimerase encompassed by 
5 the present invention comprises an amino acid sequence selected from the group of SEQ 
ID NO:2, SEQ ID NO:4, SEQ ID NO:6, SEQ ID NO:8, SEQ ID NO: 10 or a fragment 
thereof wherein the fragment is capable of catalyzing the conversion of GDP-D-mannose 
to GDP-L-galactose. It is to be understood that the nucleic acid sequence encoding the 
amino acid sequences identified herein can vary due to degeneracies. As used herein, 

1 0 nucleotide degeneracies refers to the phenomenon that one amino acid can be encoded by 
different nucleotide codons. 

One embodiment of the present invention relates to a method to identify an 
epimerase that catalyzes conversion of GDP-D-mannose to GDP-L-galactose. Preferably, 
such a method is useful for identifying the GDP-D-mannose: GDP-L-galactose epimerase 

15 which catalyzes the conversion of GDP-D-mannose to GDP-L-galactose in the 
endogenous (i.e., naturally CKxurring L-ascorbic acid biosynthetic pathway of 
microorganisms and/or plants). Such a method can include the steps of: (a) contacting a 
source of nucleic acid molecules with an oligonucleotide at least about 12 nucleotides in 
length under stringent hybridization conditions, wherein the oligonucleotide is identified 

20 by its ability to hybridize under stringent hybridization conditions to a nucleic acid 
sequence selected from the group consisting of SEQ ID NO:l, SEQ ID NO:3 and SEQ 
ID NO:5; and, (b) identifying nucleic acid molecules from the source of nucleic acid 
molecules which hybridize under stringent hybridization conditions to the oligonucleotide. 
Nucleic acid molecules identified by this method can then be isolated from the source 

25 using standard molecular biology techniques. Preferably, the source of nucleic acid 
molecules is obtained from a microorganism or plant that has an ascorbic acid production 
pathway. Such a source of nucleic acid molecules can be any source of nucleic acid 
molecules which can be isolated from an organism and/or which can be screened by 
hybridization with an oligonucleotide such as a probe or a PCR primer. Such sources 

30 include genomic and cDNA libraries and isolated RNA. 
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In order to screen cDNA libraries from different organisms and to isolate nucleic 
acid molecules encoding enzymes such as the GDP-D-mannose: GDP-L-galactose 
epimerase and related epirnerases, one can use any of a variety of standard molecular and 
biochemical techniques. For example, oligonucleotide primers, preferably degenerate 
5 primers, can be designed using the most conserved regions of a GDP-4-keto-6-deoxy-D- 
rnannose epimerase/reductase nucleic acid sequence, and such primers can be used in a 
polymerase chain reaction (PCR) protocol to amplify the same or related epirnerases, 
including the GDP-D-mannose: GDP-L-galactose epimerase from the ascorbic acid 
pathway, from nucleic acids (e.g., genomic or cDNA libraries) isolated from a desired 

10 organism (e.g., a microorganism or plant having an L-ascorbic acid pathway). Similarly, 
oligonucleotide probes can be designed using the most conserved regions of a GDP-4- 
keto-6-deoxy-D-mannose epimerase/reductase nucleic acid sequence and such probe can 
be used to identify and isolate nucleic acid molecules, such as from a genomic or cDNA 
library, that hybridize under conditions of low, moderate, or high stringency with the 

15 probe. 

Alternatively, the GDP-D-rnannose: GDP-L-galactose epimerase can be purified 
from an organism such as Prototheca, the N-terminal amino acid sequence can be 
determined (including the sequence of internal peptide fragments), and this information 
can be used to design degenerate primers for amplifying a gene fragment from the 

20 organism cDNA. This fragment would then be used to probe the cDNA library, and 
subsequently fragments that hybridize to the probe would be cloned in that organism or 
another suitable production organism. There is ample precedent for plant enzymes being 
expressed in an active form in bacteria, such as E. coli. Alternatively, yeast are also a 
suitable candidate for developing a heterologous system for L-ascorbic acid production. 

25 As discussed above in general for increasing the action of an enzyme in the L- 

ascorbic acid pathway according to the present invention, in one embodiment of the 
present invention, the action of an epimerase that catalyzes the conversion of GDP-D- 
mannose to GDP-L-galactose is increased by amplification of the expression (i.e., 
overexpression) of such an epimerase. Overexpression of an epimerase can be 

30 accomplished, for example, by introduction of a recombinant nucleic acid molecule 
encoding the epimerase. It is preferred that the gene encoding an epimerase according to 
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the present invention be cloned under control of an artificial promoter. The promoter can 
be any suitable promoter that will provide a level of epimerase expression required to 
maintain a sufficient level of L-ascorbic acid in the production organism. Preferred 
promoters are constitutive (rather than inducible) promoters, since the need for addition 
5 of expensive inducers is therefore obviated. The gene dosage (copy number) of a 
recombinant nucleic acid molecule according to the present invention can be varied 
according to the requirements for maximum product formation. In one embodiment, the 
recombinant nucleic acid molecule encoding an epimerase according to the present 
invention is integrated into the chromosome of the microorganism. 

10 It is another embodiment of the present invention to provide a microorganism 

having one or more epimerases according to the present invention with improved affinity 
for its substrate. An epimerase with improved affinity for its substrate can be produced 
by any suitable method of genetic modification or protein engineering. For example, 
computer-based protein engineering can be used to design an epimerase protein with 

15 greater stability and better affinity for its substrate. See for example, Maulik et al., 1997, 
Molecular Biotechnology: Therapeutic Applications and Strategies, Wiley-Liss, Inc., 
which is incorporated herein by reference in its entirety. 

As noted above, in the method for production of L-ascorbic acid of the present 
invention, a microorganism having a genetically modified L-ascorbic acid production 

20 pathway is cultured in a fermentation medium for production of L-ascorbic acid. An 
appropriate, or effective, fermentation medium refers to any medium in which a genetically 
modified microorganism of the present invention, when cultured, is capable of producing 
L-ascorbic acid. Such a medium is typically an aqueous medium comprising assimilable 
carbon, nitrogen and phosphate sources. Such a medium can also include appropriate 

25 salts, minerals, metals and other nutrients. One advantage of genetically modifying a 
microorganism as described herein is that although such genetic modifications can 
significantly alter the production of L-ascorbic acid, they can be designed such that they 
do not create any nutritional requirements for the production organism. Thus, a minimal- 
salts medium containing glucose as the sole carbon source can be used as the fermentation 

30 medium. The use of a minimal-saks-ghicose medium for the L-ascorbic acid fermentation 
will also facilitate recovery and purification of the L-ascorbic acid product. 
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In one mode of operation of the present invention, the carbon source 
concentration, such as the glucose concentration, of the fermentation medium is 
monitored during fermentation. Glucose concentration of the fermentation medium can 
be monitored using known techniques, such as, for example, use of the glucose oxidase 
5 enzyme test or high pressure liquid chromatography, which can be used to monitor 
glucose concentration in the supernatant, e.g., a cell-free component of the fermentation 
medium. As stated previously, the carbon source concentration should be kept below the 
level at which cell growth inhibition occurs. Although such concentration may vary from 
organism to organism, for glucose as a carbon source, cell growth inhibition occurs at 

10 glucose concentrations greater than at about 60 g/L, and can be determined readily by 
trial. Accordingly, when glucose is used as a carbon source the glucose concentration in 
the fermentation medium is maintained in the range of from about 1 g/L to about 100 g/L, 
more preferably in the range of from about 2 g/L to about 50 g/L, and yet more preferably 
in the range of from about 5 g/L to about 20 g/L. Although the carbon source 

IS concentration can be maintained within desired levels by addition of, for example, a 
substantially pure glucose solution, it is preferred to maintain the carbon source 
concentration of the fermentation medium by addition of aliquots of the original 
fermentation medium. The use of aliquots of the original fermentation medium are 
desirable because the concentrations of other nutrients in the medium (e.g. the nitrogen 

20 and phosphate sources) can be maintained simultaneously. Likewise, the trace metals 
concentrations can be maintained in the fermentation medium by addition of aliquots of 
the trace metals solution. 

In an embodiment of the fermentation process of the present invention, a 
fermentation medium is prepared as described above. This fermentation medium is 

25 inoculated with 

an actively growing culture of genetically modified microorganisms of the present 
invention in an amount sufficient to produce, after a reasonable growth period, a high cell 
density. Typical inoculation cell densities are within the range of from about 0. 1 g/L to 
about 15 g/L, preferably from about 0.5 g/L to about 10 g/L and more preferably from 

30 about 1 g/L to about 5 g/L, based on the dry weight of the cells. The cells are then grown 
to a cell density in the range of from about 10 g/L to about 100 g/L preferably from about 
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20 g/L to about 80 g/L, and more preferably from about 50 g/L to about 70 g/L. The 
residence times for the microorganisms to reach the desired cell densities during 
fermentation are typically less than about 200 hours, preferably less than about 120 hours, 
and more preferably less than about 96 hours. 
5 The microorganisms useful in the method of the present invention can be cultured 

in conventional fermentation modes, which include, but are not limited to, batch, fed- 
batch, and continuous. It is preferred, however, that the fermentation be carried out in 
fed-batch mode. In such a case, during fermentation some of the components of the 
medium are depleted. It is possible to initiate fermentation with relatively high 

1 0 concentrations of such components so that growth is supported for a period of time before 
additions are required. The preferred ranges of these components are maintained 
throughout the fermentation by making additions as levels are depleted by fermentation. 
Levels of components in the fermentation medium can be monitored by, for example, 
sampling the fermentation medium periodically and assaying for concentrations. 

1 5 Alternatively, once a standard fermentation procedure is developed, additions can be made 
at timed intervals corresponding to known levels at particular times throughout the 
fermentation. As will be recognized by those in the art, the rate of consumption of 
nutrient increases during fermentation as the cell density of the medium increases. 
Moreover, to avoid introduction of foreign microorganisms into the fermentation medium, 

20 addition is performed using aseptic addition methods, as are known in the art. In addition, 
a small amount of anti-foaming agent may be added during the fermentation. 

The present inventors have determined that high levels of magnesium in the 
fermentation medium inhibits the production of L- ascorbic acid due to repression of 
enzymes early in the production pathway, although enzymes late in the pathway (i.e., from 

25 L-galactose to L-ascorbic acid) are not negatively affected (See Examples). Therefore, 
in a preferred embodiment of the method of the present invention, the step of culturing 
is carried out in a fermentation medium that is magnesium (Mg 2 *) limited. Even more 
preferably, the fermentation is magnesium limited during the cell growth phase. 
Preferably, the fomentation medium comprises less than about 0.5 g/L of Mg 2+ during the 

30 cell growth phase of fermentation, and even more preferably, less than about 0.2 g/L of 
Mg 2 *, and even more preferably, less than about 0. 1 g/L of Mg* + . 
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The temperature of the fermentation medium can be any temperature suitable for 
growth and ascorbic acid production, and may be modified according to the growth 
requirements of the production microorganism used. For example, prior to inoculation 
of the fermentation medium with an inoculum, the fermentation medium can be brought 
5 to and maintained at a temperature in the range of from about 20 °C to about 45 °C, 
preferably to a temperature in the range of from about 25 °C to about 40° C, and more 
preferably in the range of from about 30°C to about 38°C. 

It is a further embodiment of the present invention to supplement and/or control 
other components and parameters of the fermentation medium, as necessary to maintain 

10 and/or enhance the production of L-ascorbic acid by a production organism. For example, 
in one embodiment, the pH of the fermentation medium is monitored for fluctuations in 
pH. In the fermentation method of the present invention, the pH is preferably maintained 
at a pH of from about pH 6.0 to about pH 8.0, and more preferably, at about pH 7.0. In 
the method of the present invention, if the starting pH of the fermentation medium is pH 

15 7.0, the pH of the fermentation medium is monitored for significant variations from pH 
7.0, and is adjusted accordingly, for example, by the addition of sodium hydroxide. In a 
preferred embodiment of the present invention, genetically modified microorganisms 
useful for production of L-ascorbic acid include acid-tolerant microorganisms. Such 
microorganisms include, for example, microalgae of the genera Prototheca and Chlorella 

20 (See U.S. Patent No. 5,792,631, ibid and U.S. Patent No. 5,900,370, ibid). 

The production of ascorbic acid by culturing acid-tolerant microorganisms 
provides significant advantages over known ascorbic acid production methods. One such 
advantage is that such organisms are acidophilic, allowing fermentation to be carried out 
under low pH conditions, with the fermentation medium pH typically less than about 6. 

25 Below this pH, extracellular ascorbic acid produced by the microorganism during 
fermentation is relatively stable because the rate of oxidation of ascorbic acid in the 
fermentation medium by oxygen is reduced. Accordingly, high productivity levels can be 
obtained for producing L-ascorbic acid with acid-tolerant microorganisms according to 
the methods of the present invention. In addition, control of the dissolved oxygen content 

30 to very low levels to avoid oxidation of ascorbic acid is unnecessary. Moreover, this 
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advantage allows for the use of continuous recovery methods because extracellular 
medium can be treated to recover the ascorbic acid product. 

Thus, the present method can be conducted at low pH when acid-tolerant 
microorganisms are used as production organisms. The benefit of this process is that at 
5 low pH, extracellular ascorbic acid produced by the organism is degraded at a reduced 
rate than if the fermentation medium was at higher pH. For example, prior to inoculation 
of the fermentation medium with an inoculum, the pH of the fermentation medium can be 
adjusted, and further monitored during fermentation. Typically, the pH of the 
fermentation medium is brought to and maintained below about 6, preferably below 5 .5, 

10 and more preferably below about 5. The pH of the fermentation medium can be 
controlled by the addition of ammonia to the fermentation medium. In such cases when 
ammonia is used to control pH, it also conveniently serves as a nitrogen source in the 
fermentation medium. 

The fermentation medium can also be maintained to have a dissolved oxygen 

15 content during the course of fermentation to maintain cell growth and to maintain cell 
metabolism for L-ascorbic acid formation. The oxygen concentration of the fermentation 
medium can be monitored using known methods, such as through the use of an oxygen 
probe electrode. Oxygen can be added to the fermentation medium using methods known 
in the art, for example, through agitation and aeration of the medium by stirring or 

20 shaking. Preferably, the oxygen concentration in the fermentation medium is in the range 
of from about 20% to about 100% of the saturation value of oxygen in the medium based 
upon the solubility of oxygen in the fermentation medium at atmospheric pressure and at 
a temperature in the range of from about 30°C to about 40°C. Periodic drops in the 
oxygen concentration below this range may occur during fermentation, however, without 

25 adversely affecting the fermentation. 

The genetically modified microorganisms of the present invention are engineered 
to produce significant quantities of extracellular L-ascorbic acid. Extracellular L-ascorbic 
acid can be recovered from the fermentation medium using conventional separation and 
purification techniques. For example, the fermentation medium can be filtered or 

30 centrifuged to remove microorganisms, cell debris and other particulate matter, and L- 
ascorbic acid can be recovered from the cell-free supernate by conventional methods, such 
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as, for example, ion exchange, chromatography, extraction, solvent extraction, membrane 
separation, electrodialysis, reverse osmosis, distillation, chemical derivatization and 
crystallization. 

One such example of L-ascorbic acid recovery is provided in U.S. Patent No. 
5 4,595,659 by Cayle, incorporated herein in its entirety be reference, which discloses the 
isolation of L-ascorbic acid from an aqueous fermentation medium by ion exchange resin 
adsorption and ehition, which is followed by decoloration, evaporation and crystallization. 
Further, isolation of the structurally similar isoascorbic acid from fermentation medium 
by a continuous multi-bed extraction system of ani on-exchange resins is described by K 
1 0 Shimizu, Agr. Biol. Chem. 3 1 :346-353 (1967), which is incorporated herein in its entirety 
by reference. 

Intracellular L-ascorbic acid produced in accordance with the present invention 
can also be recovered and used in a variety of applications. For example, cells from the 
microorganisms can be lysed and the ascorbic acid which is released can be recovered by 

15 a variety of known techniques. Alternatively, intracellular ascorbic acid can be recovered 
by washing the cells to extract the ascorbic acid, such as through diafiltration. 

Development of a microorganism with enhanced ability to produce L-ascorbic acid 
by genetic modification can be accomplished using both classical strain development and 
molecular genetic techniques, and particularly, recombinant technology (genetic 

20 engineering). In general, the strategy for creating a microorganism with enhanced L- 
ascorbic acid production is to (1) inactivate or delete at least one, and preferably more 
than one of the competing or inhibitory pathways in which production of L-ascorbic acid 
is negatively affected (e.g., inhibited), and more significantly to (2) amplify the L-ascorbic 
acid production pathway by increasing the action of a gene(s) encoding an enzyme(s) 

25 involved in the pathway. 

In one embodiment, the strategy for creating a microorganism with enhanced L- 
ascorbic acid production is to amplify the L-ascorbic acid production pathway by 
increasing the action of GDP-D-mannose:GDP-L-galactose epimerase, as discussed 
above. Such strategy includes genetically modifying the endogenous GDP-D- 

30 marmose:GDP-L-galactose epimerase such that L-ascorbic acid production is increased, 
and/or expressing/over expressing a recombinant epimerase that catalyzes the conversion 
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of GDP-D-mannose to GDP-L-galactose, which includes expression of recombinant GDP- 
D-mannose GDP-L-gaiactose epimerase and/or homologues thereof, and of other 
recombinant epimerases such as GDP-4-keto-6-deoxy-D-mannose epimerase reductase 
and epimerases that share structural homology with such epimerase as discussed in detail 
5 above. 

It is to be understood that a production organism can be genetically modified by 
recombinant technology in which a nucleic acid molecule encoding a protein involved in 
the L-ascorbic acid production pathway disclosed herein is transformed into a suitable 
host which is a different member of the plant kingdom from which the nucleic acid 
10 molecule was derived. For example, it is an embodiment of the present invention that a 
recombinant nucleic acid molecule encoding a GDP-D-mannose:GDP-L-galactose 
epimerase from a higher plant can be transformed into a microalgal host in order to 
overexpress the epimerase and enhance production of L-ascorbic acid in the microalgal 
production organism. 

15 As previously discussed herein, in one embodiment, a genetically modified 

microorganism can be a microorganism in which nucleic acid molecules have been deleted, 
inserted or modified, such as by insertion, deletion, substitution, and/or inversion of 
nucleotides, in such a manner that such modifications provide the desired effect within the 
microorganism. A genetically modified microorganism is preferably modified by 

20 recombinant technology, such as by introduction of an isolated nucleic acid molecule into 
a microorganism. For example, a genetically modified microorganism can be transfected 
with a recombinant nucleic acid molecule encoding a protein of interest, such as a protein 
for which increased expression is desired. The transfected nucleic acid molecule can 
remain extrachromosomal or can integrate into one or more sites within a chromosome 

25 of the transfected (Le., recombinant) host cell in such a manner that its ability to be 
expressed is retained. Preferably, once a host cell of the present invention is transfected 
with a nucleic acid molecule, the nucleic acid molecule is integrated into the host cell 
genome. A significant advantage of integration is that the nucleic acid molecule is stably 
maintained in the cell. In a preferred embodiment, the integrated nucleic acid molecule 

30 is operatively linked to a transcription control sequence (described below) which can be 
induced to control expression of the nucleic acid molecule. 
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A nucleic acid molecule can be integrated into the genome of the host cell either 
by random or targeted integratioa Such methods of integration are known in the art For 
example, an K coli strain ATCC 47002 contains mutations that confer upon it an inability 
to maintain plasmids which contain a CotEl origin of replication. When such plasmids are 

5 transferred to this strain, selection for genetic markers contained on the plasmid results 
in integration of the plasmid into the chromosome. This strain can be transformed, for 
example, with plasmids containing the gene of interest and a selectable marker flanked 
by the 5'- and 3 'termini of the K coli lacZ gene. The lacZ sequences target the incoming 
DNA to the lacZ gene contained in the chromosome. Integration at the lacZ locus 

0 replaces the intact lacZ gene, which encodes the enzyme p-galactosidase, with a partial 
lacZ gene interrupted by the gene of interest. Successful integrants can be selected for 
p-galactosidase negativity. 

A genetically modified microorganism can also be produced by introducing nucleic 
acid molecules into a recipient cell genome by a method such as by using a transducing 

5 bacteriophage. The use of recombinant technology and transducing bacteriophage 
technology to produce several different genetically modified microorganism of the present 
invention is known in the art. 

According to the present invention, a gene, for example the GDP-D- 
mannose:GDP-L-galactose epimerase gene, includes all nucleic acid sequences related to 

0 a natural epimerase gene such as regulatory regions that control production of the 
epimerase protein encoded by that gene (such as, but not limited to, transcription, 
translation or post-translation control regions) as well as the coding region itself In 
another embodiment, a gene, for example the GDP-D-mannose:GDP-L-galactose 
epimerase gene, can be an allelic variant that includes a similar but not identical sequence 

5 to the nucleic acid sequence encoding a given GDP-D-mannose:GDP-L-galactose 
epimerase gene. An allelic variant of a GDP-D-mannose:GDP-L-galactose epimerase 
gene which has a given nucleic acid sequence is a gene that occurs at essentially the same 
locus (or loci) in the genome as the gene having the given nucleic acid sequence, but 
which, due to natural variations caused by, for example, mutation or recombination, has 

0 a similar but not identical sequence. Allelic variants typically encode proteins having 
similar activity to that of the protein encoded by the gene to which they are being 
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compared. Allelic variants can also comprise alterations in the 5' or 3' untranslated 
regions of the gene (e.g., in regulatory control regions). Allelic variants are well known 
to those skilled in the art and would be expected to be found within a given 
microorganism or plant and/or among a group of two or more microorganisms or plants. 
5 In accordance with the present invention, an isolated nucleic acid molecule is a 

nucleic acid molecule that has been removed from its natural milieu (i.e., that has been 
subject to human manipulation). As such, "isolated" does not reflect the extent to which 
the nucleic acid molecule has been purified. An isolated nucleic acid molecule can include 
DNA, RNA, or derivatives of either DNA or RNA. There is no limit, other than a 
10 practical limit, on the maximal size of a nucleic acid molecule in that the nucleic acid 
molecule can include a portion of a gene, an entire gene, or multiple genes, or portions 
thereof. 

An isolated nucleic acid molecule of the present invention can be obtained from 
its natural source either as an entire (Le., complete) gene or a portion thereof capable of 

15 forming a stable hybrid with that gene. An isolated nucleic acid molecule can also be 
produced using recombinant DNA technology (e.g., polymerase chain reaction (PCR) 
amplification, cloning) or chemical synthesis. Isolated nucleic acid molecules include 
natural nucleic acid molecules and homologies thereof, including, but not limited to, 
natural allelic variants and modified nucleic acid molecules in which nucleotides have been 

20 inserted, deleted, substituted, and/or inverted in such a manner that such modifications 
provide the desired effect within the microorganism. A structural homologue of a nucleic 
acid sequence has been described in detail above. Preferably, a homologue of a nucleic 
acid sequence encodes a protein which has an amino acid sequence that is sufficiently 
similar to the natural protein amino acid sequence that a nucleic acid sequence encoding 

25 the homologue is capable of hybridizing under stringent conditions to (i.e., with) a nucleic 
acid molecule encoding the natural protein (i.e., to the complement of the nucleic acid 
strand encoding the natural protein amino acid sequence). A nucleic acid molecule 
homologue encodes a protein homologue. As used herein, a homologue protein includes 
proteins in which amino acids have been deleted (e.g., a truncated version of the protein, 

30 such as a peptide), inserted, inverted, substituted and/or derivatized (e.g., by 
glycosylation, phosphorylation, acetytation, myristoylation, prenylation, palmitation, 
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amidation and/or addition of glycosylphosphatidyl inositol) in such a manner that such 
modifications provide the desired effect on the protein and/or within the microorganism 
(e.g., increased or decreased action of the protein). 

A nucleic acid molecule homologue can be produced using a number of methods 
5 known to those skilled in the art (see, for example, Sambrook et al., ibid). For example, 
nucleic acid molecules can be modified using a variety of techniques including, but not 
limited to, classic mutagenesis techniques and recombinant DNA techniques, such as site- 
directed mutagenesis, chemical treatment of a nucleic acid molecule to induce mutations, 
restriction enzyme cleavage of a nucleic acid fragment, ligation of nucleic acid fragments, 

10 PCR amplification and/or mutagenesis of selected regions of a nucleic acid sequence, 
synthesis of oligonucleotide mixtures and ligation of mixture groups to "build" a mixture 
of nucleic acid molecules and combinations thereof. Nucleic acid molecule homologues 
can be selected from a mixture of modified nucleic acids by screening for the function of 
the protein encoded by the nucleic acid and/or by hybridization with a wild-type gene. 

1 5 Although the phrase "nucleic acid molecule" primarily refers to the physical nucleic 

acid molecule and the phrase "nucleic acid sequence" primarily refers to the sequence of 
nucleotides on the nucleic acid molecule, the two phrases can be used interchangeably, 
especially with respect to a nucleic acid molecule, or a nucleic acid sequence, being 
capable of encoding a gene involved in an L-ascorbic acid production pathway. 

20 Knowing the nucleic acid sequences of certain nucleic acid molecules of the 

present invention allows one skilled in the art to, for example, (a) make copies of those 
nucleic acid molecules and/or (b) obtain nucleic acid molecules including at least a portion 
of such nucleic acid molecules (e.g., nucleic acid molecules including full-length genes, 
full-length coding regions, regulatory control sequences, truncated coding regions). Such 

25 nucleic acid molecules can be obtained in a variety of ways including traditional cloning 
techniques using oligonucleotide probes to screen appropriate libraries or DNA and PCR 
amplification of appropriate libraries or DNA using oligonucleotide primers. Preferred 
libraries to screen or from which to amplify nucleic acid molecule include bacterial and 
yeast genomic DNA libraries, and in particular, microalgal genomic DNA libraries. 

30 Techniques to clone and amplify genes are disclosed, for example, in Sambrook et al., 
ibid 
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The present invention includes a recombinant vector, which includes at least one 
isolated nucleic acid molecule of the present invention, inserted into any vector capable 
of delivering the nucleic acid molecule into a host microorganism of the present invention. 
Such a vector can contain nucleic acid sequences that are not naturally found adjacent to 
5 the isolated nucleic acid molecules to be inserted into the vector. The vector can be either 
RNA or DNA and typically is a plasmid Recombinant vectors can be used in the cloning, 
sequencing, and/or otherwise manipulating of nucleic acid molecules. One type of 
recombinant vector, referred to herein as a recombinant molecule and described in more 
detail below, can be used in the expression of nucleic acid molecules. Preferred 

10 recombinant vectors are capable of replicating in a transformed bacterial cells, yeast cells, 
and in particular, in microalgal cells. 

Transformation of a nucleic acid molecule into a cell can be accomplished by any 
method by which a nucleic acid molecule can be inserted into the cell. Transformation 
techniques include, but are not limited to, transfection, electroporation, microinjection and 

15 bioli sties 

A recombinant cell is preferably produced by transforming a host cell with one or 
more recombinant molecules, each comprising one or more nucleic acid molecules 
operatively linked to an expression vector containing one or more transcription control 
sequences. The phrase, operatively linked, refers to insertion of a nucleic acid molecule 

20 into an expression vector in a manner such that the molecule is able to be expressed when 
transformed into a host cell. As used herein, an expression vector is a DNA or RNA 
vector that is capable of transforming a host cell and of effecting expression of a specified 
nucleic acid molecule. Preferably, the expression vector is also capable of replicating 
within the host cell. In the present invention, expression vectors are typically plasmids. 

25 Expression vectors of the present invention include any vectors that function (i.e., direct 
gene expression) in a yeast host cell, a bacterial host cell, and preferably a microalgal host 
cell. 

Nucleic acid molecules of the present invention can be operatively linked to 
expression vectors containing regulatory sequences such as transcription control 
30 sequences, translation control sequences, origins of replication, and other regulatory 
sequences that are compatible with the recombinant cell and that control the expression 
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of nucleic acid molecules of the present invention. In particular, recombinant molecules 
of the present invention include transcription control sequences. Transcription control 
sequences are sequences which control the initiation, elongation, and termination of 
transcription. Particularly important transcription control sequences are those which 
5 control transcription initiation, such as promoter, enhancer, operator and repressor 
sequences. Suitable transcription control sequences include any transcription control 
sequence that can function in yeast or bacterial cells or preferably, in microalgal cells. A 
variety of such transcription control sequences are known to those skilled in the art. 

It may be appreciated by one skilled in the art that use of recombinant DNA 

10 technologies can improve expression of transformed nucleic acid molecules by 
manipulating, for example, the number of copies of the nucleic acid molecules within a 
host cell, the efficiency with which those nucleic acid molecules are transcribed, the 
efficiency with which the resultant transcripts are translated, and the efficiency of post- 
translational modifications. Recombinant techniques useful for increasing the expression 

15 of nucleic acid molecules of the present invention include, but are not limited to, 
operatively linking nucleic acid molecules to high-copy number plasmids, integration of 
the nucleic acid molecules into the host cell chromosome, addition of vector stability 
sequences to plasmids, substitutions or modifications of transcription control signals (e.g., 
promoters, operators, enhancers), substitutions or modifications of translational control 

20 signals, modification of nucleic acid molecules of the present invention to correspond to 
the codon usage of the host cell, deletion of sequences that destabilize transcripts, and use 
of control signals that temporally separate recombinant cell growth from recombinant 
enzyme production during fermentation. The activity of an expressed recombinant protein 
of the present invention may be improved by fragmenting, modifying, or derivatizing 

25 nucleic acid molecules encoding such a protein. 

The following experimental results are provided for the purposes of illustration and 
are not intended to limit the scope of the invention. 
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EXAMPLES 

present example describes the elucidation of the pathway from glucose to L- 
d through GDP-D-mannose in plants. 

e the present inventors have previously shown that Prototheca makes L- 
1 (AA) from glucose, it was worthwhile to examine cultures for some of the 
sion products of glucose. In the past, the present inventors had concentrated 
. from glucose to organic acids, based on the published pathway of L-ascorbic 
sis in animals and proposed pathways in plants. The present inventors 
herein that the pathway from glucose to L-ascorbic acid involves not organic 
ither sugar phosphates and nucleotide diphosphate sugars (NDP-sugars). 
r to the present invention, it was known that all cells synthesize 
des by first forming NDP-sugars . The sugar moiety is then incorporated into 
ile the cleaved NDP is recycled. A variety of polysaccharides are known, and 
amed based on the relative proportions of the sugar residues in the polymers, 
e, a "galactomannan" contains mostly galactose, and to a lesser degree, 
sidues. The "biopolymer" from Prototheca strains isolated by the present 
as analyzed and found to be 80% E>-galactose, 18% rhamnose (D- or L- 
>n not determined), and 2% L-arabinose. The present inventors provide 
jrein of how the respective NDP-sugars that make up the Prototheca 
ire formed, and what correlations exist between L-ascorbic acid synthesis and 
>n of the NDP-sugar forms of the sugar residues found in the biopolymer. 
common NDP-sugar UDP-glucose is shown in Fig. 2B. This is formed in 
glucose-l-P by the action of UDP-D-ghicose pyrophosphorylase. UDP- 
t>e epimerized in plants to form UDP-D-galactose, using UDP-D-glucose-4- 
LTDP-D-galactose can also be formed by phosphorylation of D-galactose by 
te to form D-galactose-E-P, which can be converted to UDP-D-galactose by 
ctose pyrophosphorylase. These known routes were believed to account for 
ose in the Protothecu biopolymer. The UDP-L-arabinose can be formed by 
ions beginning with the oxidation of UDP-D-ghicose to UDP-D-glucuronic 
DP-D-glucose dehydrogenase), decarboxylation to UDP-D-xylose, and 
►n to UDP-L-arabinose. This accounts for the arabinose residues in the 
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biopolymer. UDP-L-rhamnose is known to be formed from UDP-D-glucose, thus all 
three of the sugar moieties in the Prototheca biopolymer can be accounted for by a 
pathway through glucose- 1-P and UDP-glucose Alternatively, if the rhamnose in the 
biopolymer is D-rhamnose, it is not formed via UDP-D-glucose, but by oxidation of GDP- 
5 D-mannose (See Fig. 1). 

GDP-D-rhamnose is formed by converting glucose, in turn, to D-glucose-6-P, D- 
fructose-6-P, D-mannose-6-P, D-mannose-l-P, GDP-D-mannose, and GDP-D-rhamnose. 
It was of interest to the present inventors that this route passes through GDP-D-mannose. 
Exogenous marmose is known to be converted to D-mannose-6-P in plants, and can enter 
10 the path above. D-mannose is converted to L-ascorbic acid by Prototheca cells cultured 
by the present inventors as well or better than glucose (see Example 4). The mechanism 
of conversion, in Chlorella pyrenoidosa , of GDP-D-mannose to GDP-L-galactose by 
GDP-D-mannose : GDP-L-galactose epimerase, has been known for years (See, Barber, 
1971, Arch. Biochem. Biophys. 147:619-623, incorporated herein by reference in its 
15 entirety). The present inventors have discovered herein that L-galactose and L-galactono- 
y -lactone are rapidly converted to L-ascorbic acid by strains of Prototheca and Chlorella 
pyrenoidosa. Prior to the present invention, it was known that L-galactono-v -lactone is 
converted to L-ascorbic acid in several plant systems, but the synthesis steps prior to this 
step were unknown. Based on the published literature and the present experimental 
20 evidence, the present inventors have determined that the L-ascorbic acid biosynthetic 
pathway in plants passes through GDP-D-mannose and involves sugar phosphates and 
NDP-sugars. The proposed pathway is shown in Fig. 1. Salient points relevant to the 
design and production of genetically modified microorganisms useful in the present 
method include: 

1 . The enzymes leading from D-gtucose to D-fhictose-6-P are well known 
enzymes in the first, uncommitted steps of glycolysis. 

2. The enzymes involved in the conversion of D-fructose-6-P to GDP-D- 
mannose have been well characterized in plants, yeast, and bacteria, particularly 
Azotobacter vinelandii and Pseudomonas aeruginosa, which convert GDP-D-mannose 
to GDP-D-mannuronic acid, which is the precursor for alginate (See for example, Sa- 
Correia et al., 1987, J. Bacteriol. 169:3224-3231; Koplin et al., 1992, J. Bacteriol. 
174:191-199; Oesterhek et al., 1996, Plant Science 121:19-27; Feingold et al., 1980 The 
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fij fK-Jigmistrv nf Plants : Vol 3: Carbohydrates, structure and function, P K. Stampf & E E. 
Conn, eds., Academic Press, New York, pp. 101-170; Smith etal., 1992,M>/. Cell Biol 
12:2924-2930; Boles et al., 1994, Eur. 1 Med 220:83-96; Hashimoto et al., 1997, J. 
Biol Chem. 272:16308-16314, all of which are incorporated herein by reference in their 
5 entirety). 

3. Barber (1971, supra, and 1975) identified in Chlorella pyrenoidosa the 
enzyme activities for the conversion of GDP-D-mannose to GDP-L-galactose and L- 
galactose-l-P. 

4. The present inventors have shown herein the rapid conversion of L- 
1 0 galactose and L-galactono-y -lactone to L-ascorbic acid by Prototheca cells. 

5. L-galactono-y-lactone and L-galactonic acid can be interconverted in 
solution by changing the pH of the solution; addition of base shifts the equilibrium to L- 
galactonic acid, while addition of acid shifts the equifflnium to the lactone. Cells may have 
an enzymatic means for this conversion in addition to this non-enzymatic route. 

15 6. In plants, GDP-L-fucose is also formed from GDP-D-mannose, presumably 

for incorporation into polysaccharide. Roberts (1971) fed labeled D-mannose to corn root 
tips and found the label in polysaccharide, specifically in the residues of D-mannose, L- 
galactose, and L-fiicose. No label was detected in D-glucose, D-galactose, L-arabinose, 
or D-xylose Prototheca and C. pyrenoidosa cells have the ability to convert L-fucose 

20 (6-deoxy-L-galactose) to a dipyridyl-poshive product that was shown by HPLC not to be 
L-ascorbic acid. The present inventors believe that it is was the 6-deoxy analog of L- 
ascorbic acid. 



Example 2 

This example shows that in Prototheca, like other plants (Loewus, FA 1988. In: 
25 J. Priess (ed), The Biochemistry of Plants, 14:85-107. New York, Academic Press) and 
the green microalga Chlorella pyrenoidosa (Renstrom, et al '., 1983. Plant Sci. Lett. 
28:299-305), ascorbic acid (AA) production from glucose proceeds by a biosynthetic 
pathway that allows retention of the configuration of the carbon skeleton of glucose. 

Cultures of the strain UV77-247 were grown to moderate cell density in shake 
30 flasks with l-"C-labeled glucose as 10% of the total glucose (40 g/L). Incubation was 
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as per the standard Mg-limited screen (see Example 3). The culture supernates were 
clarified, deionized to remove salts, lyophilized, and subjected to nuclear magnetic 
resonance (nmr) analysis to determine where in the AA molecule the 13 C was located. In 
each case, approximately 85% of the label was found at the C-l position of A A, with most 
5 of the remaining label at the C-6 position. This strongly indicated that AA is synthesized 
from glucose by a pathway that retains the carbon chain configuration, i. e., C-l of 
glucose becomes C-l of AA This has typically been observed in plants (Loewus, FA. 
1988. Ascorbic acid and its metabolic products. In: The Biochemistry of Plants, ed. J. 
Priess, 14:85-107. New York, Academic Press). Animals (Mapson, L.W. and F.A. 

10 Isherwood 1956. Biochem. J. 64:151-157; Loewus, F.A. 1960. J. Biol. Chem. 
235(4):937-939) and protists such as Euglena (Shigeoka, S., et aL, 1979. J. Nutr. Sci. 
Vitaminol. 25:299-307), on the other hand, synthesize AA by a pathway that involves the 
inversion of configuration, /. e., C-l of glucose becomes C-6 of AA Demonstration of 
the inversion/non-inversion nature of the pathway was an important step in determining 

15 the pathway of AA biosynthesis since the two types of pathways require different types 
of enzymatic reactions. The label found at C-6 of AA is thought to be due to metabolism 
of glucose and subsequent gluconeogenesis. The metabolism of glucose in glycolysis 
proceeds through triose-phosphate intermediates. After this, the C-l and C-6 carbons of 
glucose become biochemically equivalent. Hexose phosphates can be regenerated from 

20 the triose phosphates by gluconeogenesis, which is essentially a reversal of the degradative 
pathway. Consequently, metabolism of C-l -labeled glucose to triose phosphates with 
subsequent gluconeogenesis would result in the formation of hexose phosphate molecules 
labeled at either or both C-l and C-6. If those hexose phosphates were precursors to AA, 
one would expect the AA to be similarly labeled. Consistent with this type of "isotopic 

25 mixing" is the observation that sucrose obtained from l- 13 C-labeled glucose was labeled 
at positions 1, 6, 1' and 6*. 

Glucose can also be metabolized by the pentose phosphate pathway, the overall 
balanced equation for which is: 

3 Glucoses-phosphate - ♦ 2 Fructoses-phosphate + Gtyceraktehyde-3- phosphate + 3 C0 2 
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Based on the known biochemistry, it would then be expected that the label at each 
of the carbons in glucose (Table 1 left column) would appear at the positions for the other 
molecules shown, and that these patterns would be reflected in the AA formed from C-2- 
and C-3-labeled glucose. 

5 TABLE 1 

Predicted Carbon Labeling of Metabolites of Glucose in the Pentose Phosphate Pathway 



Labeled Glucose 


Position of Labeled Carbon in: 


Carbon 


co 2 


F6P(t) 


F6P(2) 


G3P 


1 


+ 








2 




1.3 


1 




3 




2 


2,3 




4 




4 


4 


1 


5 




5 


5 


2 


6 




6 


6 


3 



1 5 AA recovered from cultures fed glucose labeled at C-2 or C-3 was also analyzed 

for its labeling patterns (Table 2). 



TABLE 2 

Labeling Pattern in AA after Cells were Fed 2- 13 C and 3- 13 C-glucose 



20 



Carbon 
Position in AA 


Isotonic enhancement after growth on: 


C-2 labeled glucose 


C-3 labeled glucose 


1 


1.0 


0.4 


2 


10.0 


0.9 


3 


0.5 


9.9 


4 


0 


2.8 


5 


22 


02 


6 


0 


0 



The data above again suggest a pathway from glucose to AA that proceeds by 
retention of configuration. As in the experiments with C-l labeled glucose, approximately 
one-fifth of the label is present in "mirror image 1 * position to the glucose label (C-5 for C-2 
30 labeled glucose and C-4 for C-3 labeled glucose), indicating levels of gluconeogenesis 
consistent with those previously observed. 

The small, but significant amount of enhancement observed in other positions is 
consistent with flux through the pentose phosphate pathway. As predicted above, carbon 
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flux through this pathway would result in isotopic enhancement at positions 1 and 3 when 
cells were grown on 2- 13 C glucose and enhancement at position 2 when cells were grown 
on 3- 13 C glucose. This is indeed observed. That there is twice as much enhancement at 
C-l as there is at C-3 after growth on 2- 13 C glucose is also predicted. These data indicate 
5 a small but measurable amount of carbon flux through the pentose phosphate pathway. 



Example 3 

This example shows the methods for generating, screening and isolating mutants 
of Prototheca with altered AA productivities compared to the starting strain ATCC 
75669. 

10 ATCC No. 75669, identified as Prototheca mohformis RSP1385 (unicellular 

green microalga), was deposited on February 8, 1994, with the American Type Culture 
Collection (ATCC), Rockville, Maryland, 20852, USA, under the terms of the Budapest 
Treaty on the International Recognition of the Deposit of Microorganisms for the Purpose 
of Patent Procedure. Initial screening of Prototheca species and strains was reported in 

15 U.S. Patent No. 5,900,370, ibid. Table 3 lists the formulations of the media for growth 
and maintenance of the strains. Glucose for fermentors was supplied as glucose 
monohydrate and calculated on an anhydrous basis. The recipe for the trace metals 
solution is given in Table 4. The standard growth temperature was 35 °C. All organisms 
were cultured axenically. 



20 TABLE 3 

Media for Growth and Maintenance of Prototheca Strains 
All quantities are in g/L unless otherwise specified 



Ingredient 


Liquid 


Agar 


Standard 


Mg-limiting 


Slants 


Ferrozine 
Plates 


Standard 
Plates 


Potassium phosphate 
monobasic 


1.3 


1.3 


2.0 


0.27 


2.0 


Potassium phosphate dibasic 


3.8 


3.8 


2.0 


1.4 


2.0 


Trisodtum citrate dthydrate 


7.7 


7.7 


2.6 


1.3 


2.6 


Magnesium sulfate 
heptahydrate 


0.40 


0.02 


0.4 


0.01 


0.4 


Ammonium sulfate 


3.7 


3.7 


1.0 


1.0 


1.0 


Trace Metals Solution 


2mL 


2mL 


2mL 


2mL 


2mL 


Ferrous sulfate heptahydrate 


1.5 mg 


4.5 mg 


1.5 mg 




1.5 mg 


Calcium chloride dihydrate 




0.25 
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IrvirarliAnt 


Liquid 


Agar 


Standard 


Mg-limiting 


Slants 


Ferrozine 
Plates 


Standard 
Plates 


Manga nous sulfate 
monohydrate 




0.08 








Yeast extract 






2.5 






Agar 






15 


15 
(Noble) 


15 


pH before autoclaving 


12. 


72 


72 


72 


72 


Autoclave, then add 


Copper sulfate, 
penta hydrate, 100 g/L 








2mL 




40 g/L Ferrozine in 5 mM 
phosphate (pH 7.5 final) 








8.8 mL 




Ferric ammonium sulfate 
dodeca hydrate, 40 g/L 








3.8 mL 




50% glucose with 25 mg/L 
thiamine HCI 


40 mL 


60 mL 


10 mL 


10 mL 


10 mL 



15 TABLE 4 

Trace Metals Solution 







Cone, of Individ. 


mL Indiv. Stock per 


Compound 


Molecular Weight 


Solutions, g/L 


liter of Working Stock 


Distilled Water 






823 


Hydrochloric Acid 




Cone. 


20 


Cobalt Chloride hexahydrate 


237.9 


24.0 


6.5 


Boric acid 


61.8 


38.1 


24 


Zinc sulfate heptahydrate 


287.5 


35.3 


50 


Manganous sulfate 


169.0 


24.6 


50 


monohydrate 








Sodium motybdate dihydrate 


242.0 


23.8 


2.0 


Calcium chloride dihydrate 


147.0 




11.4g 


Vanadyl sulfate dihydrate 


199.0 


10.0 


8.0 


Nickel nitrate hexahydrate 


290.8 


5.0 


8.0 


Sodium setenite 


173.0 


5.0 


8.0 



30 Mutant isolates were generated by treatment with one or more of the following 

agents: nitrous acid (NA); ethyl methane sulfonate (EMS); or ultraviolet light (UV). 
Typically, glucose-depleted cells grown in standard liquid medium were washed and 
resuspended in 25 mM phosphate buffer, pH 7.2, diluted to approximately 10 7 
c»lony-forrning units per mL (cfu/rnL), exposed to the mutagen to achieve about 99% kill, 

35 incubated 4-8 hours in the dark, and spread onto standard agar medium, or agar media 
containing differential agents. 
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Some mutant colonies on standard agar medium were picked randomly and 
subcukured to master plates. Other isolation plates were inverted over chloroform to ryse 
cells on the surface of the colonies and allow them to release AA. Released AA was 
detected by spraying the treated plates with a solution of 2,6-dichrorophenol-indophenol 
5 (1.25 g/L in 70% EtOH). The ability of AA to reduce this blue redox dye to its colorless 
form is the basis for a standard assay of AA (Omaye, et ai. t 1979. Meth. Enzymol. 
62:3-11.). Colonies derived from mutagenized cells were saved to master plates for 
further evaluation if their clear halos were significantly larger than the halos typical of the 
other mutants in that group. Other mutagenized cells were spread onto plates containing 

10 an AA detection system incorporated directly into the agar. This system is based on the 
ability of AA to reduce ferric iron to ferrous iron. The compound ferrozine 
(3-(2-pyridyl)-5,6- bis(4-phenylsulfonic acid)-l,2,4-triazine) was present in the agar to 
complex with the ferrous iron and give a violet color reaction. The ferrozine agar 
formulation is shown in Table 3. Colonies giving the darkest color reactions were 

15 master-plated. When screening for non-AA-producing strains (blocked mutants), white 
colonies were chosen against a background of relatively dark colonies. 

For primary screening of tube cultures, cells were inoculated from master plates 
into 4 mL ofMg-lirniting medium in 16 x 125 mm test tubes, and tubes were shaken in a 
slanted position on a rotary shaker at 300 rpm for four days. After both three and four 

20 days of incubation aliquots were removed for AA assay and cell density detennination. 
Those for AA assay were centrifuged at 1 500 x g for 5 min and the resulting supernates 
were removed for either colorimetric assay or high pressure liquid chromatography 
(HPLC). Promising isolates were retested in tube culture. Those passing the tube screen 
were tested in shake flasks. 

25 For secondary screening of flask cultures, cells were inoculated into 50 mL of 

standard flask medium in 250 mL baffled shake flasks, and incubated on a rotary shaker 
at 180 rpm until glucose depletion (24-48 hours). A second series of flasks of 
Mg-sufficient standard medium was inoculated from the first set to a cell density of 0. 15 
A« 20 , and incubated for 24 hours. A third series of Mg-limiting flask medium was 

30 inoculated from the second set by a 1/50 dilution and incubated for 96 hours. Flasks were 
sampled for AA analysis and cell density measurements during this time as required. 
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Aliquots for supernatant AA analysis were centrifuged at 5000 x g for 5 min. Aliquots 
for total whole broth AA analysis were first extracted for 1 5 min with an equal volume of 
5% trichloroacetic acid (TCA) before ceritrifagation. Aliquots of the resulting supernates 
were removed for either colorimetric assay or HPLC analysis. 
5 For colorimetric assay of AA, a modification of the method of Omaye, et al 

(1979. Meth. Enzymol. 62:3-1 1) was used. Twenty-five uL aliquots of culture supernates 
were added to wells of 96- well microplates, and 125 uL of color reagent was added. The 
color reagent consisted of four parts 0.5% aqueous 2,2'-dipyridyl and one part 8.3 mM 
ferric ammonium sulfate in 27 % (v/v) o-phosphoric acid, the two components being 

10 mixed immediately before use. After one hour, the absorbance at 520 nm was read. AA 
concentration was calculated by comparison of the absorbances of AA standards. 

HPLC analysis was based on that of Running, et al. y (1994). Supernates were 
chromatographed on a Bio-Rad HPX-87H organic acid column (Bio-Rad Laboratories, 
Richmond, CA) with 13 mM nitric acid as solvent, at a flow rate of 0.7 mL/min at room 

15 temperature. Detection was at either 254 nm using a Waters 441 detector (Millipore 
Corp., Milford, MA), or at 245 nm using a Waters 481 detector. This system can 
distinguish between the L- and D- isomers of AA 

For dry weight determinations of cell density, 5 mL whole broth samples were 
centrifuged at 5000 x g for 5 min, washed once with distilled water, and the pellet was 

20 washed into a tared aluminum weighing pan. Cells were dried for 8-24 h at 1 05 ° C. Cell 
weight was calculated by difference. 

Table 5 shows the abilities of various mutants of Prototheca to synthesize AA 



TABLE 5 

AA Synthesizing Ability of Various Prototheca Mutants in Flask Screen 



Strain 


Specific AA Formation, mg AA per UCutture A^, 
during Mg-limited Incubation 


2 Days Incubation 


4 Days Incubation 


ATCC 75669 


22 


35 


EMS13-4 


79 


166 


UV213-1 


0 


0 


UV218-1 


0 


0 


UV244-1 


0 


0 
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Strain 


Specific AA Formation, mg AA per UCufture A^, 
during Mg-iimitBd Incubation 


2 Days Incubation 


4 Days Incubation 


UV244-15 


58 


68 


UV77-247 


56 


83 


UV140-1 


67 


100 


UV164-6 


91 


131 


NA21-14 


27 


78 


UV82-21 


0 


0 


UV127-10 


50 


95 


SP2-3 


3 


4 



1 o The genealogy of these isolates is presented graphically in the "family tree" of Fig. 

3. ATCC No. , identified as Prototheca moriformis EMS 13 -4 (unicellular green 

microalga), was deposited on May 25, 1999, with the American Type Culture Collection 
(ATCC), 10801 University Boulevard, Manassas, VA 201 10, USA, under the terms of 
the Budapest Treaty on the International Recognition of the Deposit of Microorganisms 

15 for the Purpose of Patent Procedure. ATCC No. , identified as Prototheca 

moriformis UV127-10 (unicellular green microalga), was deposited on May 25, 1999, 
with the American Type Culture Collection (ATCC), 10801 University Boulevard, 
Manassas, VA 201 10, USA, under the terms of the Budapest Treaty on the International 
Recognition of the Deposit of Microorganisms for the Purpose of Patent Procedure. 

20 ATCC No. , identified as Prototheca moriformis SP2-3 (unicellular green 

microalga), was deposited on May 25, 1999, with the American Type Culture Collection 
(ATCC), 10801 University Boulevard, Manassas, VA 201 10, USA, under the terms of 
the Budapest Treaty on the International Recognition of the Deposit of Microorganisms 
for the Purpose of Patent Procedure. 

25 Example 4 

The following example shows that both growing and resting cells of Prototheca 
can rapidly convert L-galactose and L-galactono-y-lactone to AA, and that conversion 
of D-mannose to AA by Prototheca is more rapid than conversion of D-gtucose. 

Shake flask cultures of the mutant strain UV77-247 were grown to glucose 
30 depletion in standard liquid medium (Table 3). Cells were washed twice and resuspended 
in complete medium with the glucose substituted by one of the compounds listed below. 
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Cell suspensions were incubated for 24 hours at 35° C with shaking, and the entire 
suspension was extracted with TCA as above and assayed for AA 

Tables 6-8 show that both growing and resting cells of strain UV77-247 can 
rapidly convert ^galactose and L-galactono-y-lactone to AA. In these experiments, 
D-fructose and D-galactose were converted to AA at the same rate as D-glucose, 
suggesting that they are metabolized to AA through the same route as D-glucose. None 
of the organic acids suggested in the literature to be intermediates in the biosynthesis of 
AA were converted to AA, including sorbosone, which has been proposed as an 
intermediate by Saito <?/a/.(1990 Plant Physiol. 94:1496-1500). 



10 



15 



20 



25 



TABLE 6 

Conversion of Compounds by Resting Cells of Strain UV77-247 



30 



Substrate (50 mM) 


Total AA, mg/L 


AA Relative to No 
Substrate Control 


L-galactose 


965 


623 


L-galactono-v-lactone 


818 


476 


D-fructose 


590 


248 


D-glucosone 


589 


247 


D-glucose 


584 


242 


D-galactose 


542 


200 


D-glucose (10 mM) 


388 


46 


D-glucono lactone 


382 


40 


D-gulono-v-lactone 


366 


24 


D-glucuronate 


364 


22 


L-sorbosone 


342 


0 


None 


342 


0 


2-keto-D-glucorHC acid 


341 


-1 


D-isoascorbic acid (10 mM) 


330 


-12 


D-glucuronolactone 


329 


-13 


Dgluconic acid 


309 


-33 


D-galacturonic acid 


297 


-45 


L-idonate 


296 


-46 



35 



Since strain UV77-247 converted L-galactose and L-galactono-y-lactone to AA 
much more rapidly than it did glucose, it suggests that these compounds are intermediates 
in the AA biosynthetic pathway and that they are "downstream" from glucose. 

The data in Tables 7 and 8 also show that growing and resting cells of UV77-247 
consistently convert D-mannose to AA at a rate greater than that of glucose. 
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TABLE 7 



Conversion of Compounds to AA by Resting Cells of Strain UV77-247 



Compound 


Ascorbic Acid, mg/L 


25.5 h 


30 h 


47 h 


L-galactose 


667 


718 


620 


L-galactono-y-lactone 


644 


661 


749 


D-glucosone 


465 


462 


354 


D-mannose 


446 


462 


399 


D-fructose 


402 


408 


367 


d-gkicose 


395 


404 


351 


D-galactose 


352 


361 


337 


none 


287 


288 


258 



TABLES 

Conversion of Compounds to AA by Growing Cells of Strain UV77-247 





Ascorbic Acid, mg/L 


| A«2D 


| AA/A^ 


Compound 


25.5 h 


44h 


L-galactose 


249 


506 


4.5 


112 


D-mannose 


228 


488 


5.6 


87 


L-galactono-y-lactone 


214 


342 


5.0 


68 


D-glucose 


178 


398 


5.9 


67 


D-fructose 


181 


383 


5.9 


65 


D-glucosone 


176 


362 


5.7 


64 


D-galactose 


185 


380 


5.9 


64 


none 


182 


249 


4.4 


57 


D-gluconic acid (K) 


178 


262 


5.0 


52 


L-idonate (Na) 


182 


232 


4.7 


49 


2-keto-D-gluconic acid 


182 


255 


5.3 


48 


2-deoxy-D-glucose 


181 


227 


4.8 


47 


D-g I neuronic acid lactone 


165 


218 


5.0 


44 


D-glucuronic acid (Na) 


173 


241 


5.6 


43 


L-gulono-y-Jactone 


152 


195 


5.0 


39 


L-sorbosone 


178 


160 


4.7 


34 


D-glucono-6-lactone 


130 


190 


5.7 


33 


D-galacturonic acid 


130 


180 


6.0 


30 



These cells converted L-galactose, L-galactono-y-lactone and D-mannose to AA 
more rapidly than they did glucose, suggesting that mannose exerts its effect in the 
35 biosynthetic pathway "downstream" from glucose. 
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Example 5 

Using the methods described above, a collection of mutants was assembled. The 
specific AA formation for representative mutants are shown in Table 5. The genealogy 
of these isolates is presented graphically in the "family tree" of Fig. 3. 
5 These isolates were tested for their ability to convert compounds which could be 

converted to AA by strain UV77-247. Testing was done as in Example 4. Results are 
shown in Table 9. 

TABLE 9 

Conversion of Compounds to AA by Resting Cells 
10 of Mutant Strains of Prototheca of Varying Abilities to Synthesize AA 



Strain 


Absolute AA, mg/L 


Buffer 


Glucose 


L-ga lactose 


L-gal^y-lact. 


Mannose 


Fructose 


EMS13-4 


53 


97 


191 


173 


139 


ND 


UV1 27-10 


45 


140 


213 


140 


128 


143 


SP2-3 


19 


19 


204 


146 


24 


27 


NA21-14 


61 


80 


147 


158 


118 


115 


UVB2-21 


15 


16 


183 


175 


18 


17 


UV213-1 


16 


15 


170 


135 


17 


16 


UV218-1 


16 


18 


136 


176 


19 


21 


UV244-1 


16 


16 


164 


162 


16 


16 


UV244-15 


26 


77 


30 


21 


94 


89 


UV244-16 


28 


64 


53 


53 


53 


66 



ND = Not Determined 



These data suggest that the mutational blocks in those strains which convert 
fructose and mannose to AA poorly are before ("upstream'' from) L-galactose and 
25 L-galactono-Y-lactone in the pathway. 

Example 6 

The following example shows that magnesium inhibits early steps in the production 

of AA 

To address the question of whether magnesium actually inhibits AA synthesis, 
30 strain NA45-3 (ATCC 20968 1 ) was grown in magnesium (Mg)-limited and Mg-sufficient 
medium. ATCC No. 209681, identified as Prototheca moriformis NA45-3 (Source: 
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repeated mutagenesis of ATCC No. 75669; Eucaryotic alga. Division Chlorophyta, Class 
Chlorophyceae, Order Chlorococcales), was deposited on March 13, 1998, with the 
American Type Culture Collection (ATCC), 10801 University Boulevard, Manassas, VA 
201 10, USA, under the terms of the Budapest Treaty on the International Recognition of 
5 the Deposit of Microorganisms for the Purpose of Patent Procedure. Cells from both 
cultures were harvested and resuspended in the cell-free supernate from the Mg-limited 
culture, and to half of each cell suspension additional magnesium was added in order to 
bring the level in the suspension to the Mg-sufficient level. The four conditions under 
which assays were run were as follows. 

10 TABLE 10 

Conditions Used to Test the Effect of Magnesium on AA Production 



Condition 


Magnesium concentration, g/L, during: 


Growth 


Assay 


1Mg>1Mg 


0.02 


0.02 


1Mg>10Mg 


0.02 


02 


10Mg>1Mg 


02 


0.02 


10Mg>10Mg 


02 


0.2 



Substrates previously shown to lead to the formation of AA, namely D-glucose, 
D-glucosone, D-fructose. D-galactose, D-mannose, and L-galactono-y -lactone, were 
added at 20 g/L to the four cell suspensions. Accumulation of AA after 24 hours was 

20 measured and compared to a control in which no substrate was added The results of this 
study are shown graphically in Fig. 4. 

When cells growing under magnesium-limited conditions were incubated with 
substrates in low-magnesium broth (lMg>lMg condition), all showed significant and 
similar accumulation of AA over the control condition. When the same cells were 

25 incubated in high magnesium broth (lMg>10Mg condition), the accumulation of AA was 
reduced about 40% for all substrates except D-mannose and L-galactono-y-lactone, 
suggesting that 1) the rate-limiting step in the conversion of D-glucose, D-glucosone, 
D-fructose, and D-galactose to AA is inhibited by magnesium or 2) magnesium stimulates 
an enzyme which results in the conversion of these compounds to some other 

30 compound(s), reducing the amount of substrate available for AA synthesis. On the other 
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hand, conversion ofD-mannose and L-galactono-y-lactone appeared to be unaffected by 
the presence of magnesium in the resuspension buffer, indicating that either 1) 
magnesium-inhibited enzymes are not involved in the conversion of these substrates to AA 
or 2) D-mannose and L-galactono-y-lactone enter the pathway far enough downstream 
5 from the point where they can be siphoned off by side reactions involving Mg-requiring 
enzymes. 

When cells were grown under magnesium-sufficient conditions, very little AA 
accumulation from any of the D-sugars was observed, regardless of the level of 
magnesium in the resuspension broth. Accumulation of AA from L-galactono-y-lactone, 

10 however, was enhanced over that observed when cells are grown in Mg-limited 
conditions. This suggests that enzymes early in the pathway are repressed under 
Mg-sufficient conditions. Thus, the D-substrates all behaved similarly, with the exception 
of the apparent lack of magnesium inhibition of D-mannose conversion to AA. This 
would suggest that D-mannose enters the AA biosynthetic pathway at a point other than 

1 5 the other D-sugars. 

Figs. 2A and 2B represent some of the fates of glucose in plants. The first 
enzymatic step in this scheme which commits carbon to glycolysis is the conversion of 
fructose- 6-P to fructose- 1,6-diP by phosphofructokinase (PFK). This reaction is 
essentially irreversible, and leads to the well known TCA cycle and oxidative 

20 phosphorylation, with concomitant ATP and NADH/NADPH generation. PFK has an 
absolute requirement for magnesium. If magnesium is limiting, this reaction could slow 
and eventually stop, blocking the flow of carbon through glycolysis and beyond, and 
would result in cessation of cell division even in the presence of excess glucose. One 
would expect fructose-6-P to accumulate under these conditions, fueling AA synthesis by 

25 the pathway shown in Figs. 1 and 2. 



Example 7 

The following example shows the correlation in Prototheca between AA 
production and the activity levels of the enzymes in the AA pathway. 
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Phosphomannose isomerase (PIvTO Assay 

PMI activity was first assayed (See Fig. 1). Ten strains representing a range of 
AA productivities were grown according to the standard protocol to measure 
AA-synthesizing ability. Cells were harvested 96 hours into magnesium-limited 
5 incubation, washed and resuspended in buffer containing 50 mM Tris/10 mM MgCl^ pH 
7.5. The suspended cells were broken in a French press, spun at 30,000 x g for 30 
minutes, and desalted through Sephadex G-25 (Pharmacia PD-10 columns). Reactions 
were carried out in the reverse direction by adding various volumes of extracts to 
solutions of Tris/Mg buffer containing 0.15 U phosphoglucose isomerase (EC 5.3.1.9), 

10 0.5 U gIucose-6-phosphatc dehydrogenase (EC 1.1.1 .49), and 1 .0 mM NADP. Reactions 
were initiated by addition of 3 mM (final) mannose-6-phosphate Final reaction volume 
was 1 .0 mL. All components were dissolved in Tris/Mg buffer. Activities were taken as 
the change in A^min. From these activities was subtracted the activities measured in 
identical reaction mixtures lacking the M-6-P substrate. Specific activities were calculated 

15 by normalizing the activities for protein concentration in the reactions. Protein in the 
original extracts was determined by the method of Bradford, using a kit from Bio-Rad 
Laboratories (Hercules, CA). All enzymes and nucleotides were purchased from Sigma 
Chemical Co. (St. Louis, MO). 

Phosphomannomutase (PMM) Assay 
20 Phosphomannomutase was measured in a similar manner in the same strains, but 

these assay reaction mixtures also contained 0.25 mM glucose- 1,6-diphosphate, 0.5 U 
commercially available PMI, and the reactions were started with the addition of 3 .0 mM 
(final) mannose- 1 -phosphate rather than mannose-6-phosphate. 

Phosphofhictpkinase (PFK) Assay 
25 To shed light on the possibility that the enhancement of AA concentration in 

cultures which were limited for magnesium was due to a diversion of carbon from normal 
metabolism by a reduced activity of the first committed step in glycolysis (PFK) the strains 
were also assayed to confirm the presence of this enzyme activity. Cells were cultured, 
washed and broken as above. Extracts were centrifuged at 1 00,000 x g for 90 min before 



WO 99/64618 PCT7US99/1 1 576 

69 

desalting. Reactions were carried out in the forward direction by adding various volumes 
of extracts to solutions of Tris/Mg buffer containing 1.5 mM dithiothreitol, 0.86 U 
aldolase (EC 4.1.2.13), 1.4 U a-grycerophosphate dehydrogenase (EC 1.1.1.8), 14 U 
triosephosphate isomerase (EC 5.3.1.1), 0.1 1 mM NADH, and l.OmMATP. Reactions 
5 were initiated by addition of 5 mM (final) fructose-6-phosphate. Final reaction volume 
was 1 .0 mL AD components were dissolved in Tris/Mg buffer. Activities were taken as 
the change in Aj^min. From these activities were subtracted the activities measured in 
identical reaction mixtures lacking the F-6-P substrate. Specific activities were calculated 
by normalizing the activities for protein concentration in the reaction. Protein in the 
10 original extracts was determined as above. 

GDP-D-mannose pyrophosphorylase ( GMP) Assay 

These same mutant strains were assayed for the next enzyme in the proposed 
pathway, GMP. Strains were grown both according to the standard Mg-limiting protocol 
(harvested 43-48 hours into magnesium-limited incubation) and in standard Mg-sufficient 

15 medium (harvesting all cells before glucose depletion). Washed cell pellets were 
resuspended in 50 mM phosphate buffer, pH 7.0, containing 20% (v/v) glycerol and 0. 1 
M sodium chloride (3 mL buffer/g wet cells), and broken in a French press. Crude extracts 
were spun at 15,000 x g for 15 minutes. Reactions were carried out in the forward 
direction by adding various volumes of extracts to solutions of 50 mM phosphate/4 mM 

20 MgClj buffer, pH 7.0, containing 1 mM GTP. Reactions were initiated by addition of 1 
mM (final) mannose- 1 -phosphate. Final reaction volume was 0. 1 mL. Reaction mixtures 
were incubated at 30 C for 10 min, filtered through a 0.45 urn PVDF syringe filter, and 
analyzed for GDP-mannose by HPLC. A Supelcosil SAX1 column (4.6 x 250 mm) was 
used with a solvent gradient (1 mlimin) of: A - 6 mM potassium phosphate, pH 3.6; B 

25 - 500 mM potassium phosphate, pH 4.5. The gradient was: 0-3 min, 100% A; 3-10 min, 
79% A; 10-15 min, 29% A. Column temperature was 30 C. Two assays that showed 
enzyme activity proportional to the amount of protein were averaged. Control 
no-substrate and no-extract reactions were also run. Specific activity was calculated by 
normalizing the activity for protein concentration in the reaction. Protein in the original 

30 extracts was determined as above. 
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GPP-D-mannose:GDP-L-gaiactose Epimerase Assay 

Further tests measured the activities of the next enzyme in the proposed pathway, 
GDP-D-mannose:GDP-L-galactose epimerase. Strains were grown according to the 
standard protocol, harvested 43-48 hours into rnagriesiuni-hrnited incubation, washed, and 
5 resuspended in buffer containing 50 mM MOPS/5 mM EDTA, pH 7.2. Washed pellets 
were broken in a French press, and spun at 20,000 x g for 20 min. Protein determinations 
were made as above and a dilution series of each was made, ranging from 0.4 to 2.2 mg 
protein/mL. 50 uL aliquots of these dilutions were added to 10 \iL aliquots of 6.3 mM 
GDP-D-mannose in which a portion of this substrate was universally labeled with U C in 

10 the mannose moiety. This substrate had an activity of 16 uCi/mL before dilution into the 
reaction mixture. Reactions were stopped after 10 min by transferring 20 uL of the 
mixture into microfuge tubes containing 20 uL of 250 mM trifluoroacetic acid (TFA) 
containing 1.0 g/L each D-mannose and L-galactose. These tubes were sealed and boiled 
for 10 min, cooled, spun for 60 sec in a Beckman Microfuge E, and 5 uL of each 

1 5 hydrorysate was spotted on 20 x 20 cm plastic-backed EM Science Silica gel 60 thin-layer 
chromatography plates (#5748/7), with 1 cm lanes created by scoring with a blunt stylus. 
After drying, plates were twice chromatographed for 2.5 hours in ethyl 
acetate:isopropanol:water, 65:22.3:12.7 (plates were dried between runs). Spots of free 
sugars were visualized by spraying dried plates with 0.5% p-anisaldehyde in a 62% 

20 ethanolic solution of 0.89 M sulfuric acid and 0. 17 mM glacial acetic acid, and heating at 
105 C for about 15 min. Spots of L-galactose and D-mannose were cut from the plates 
and counted in a scintillation counter (Beckman model 2800). For time-zero control 
counts, 16.7 uL of each extract dilution was added to 23.3 uL of the labeled substrate 
above, which had been diluted 1:7 with the TFA/mannose/galactose solution. 

25 Table 1 1 summarizes the results of the five enzyme assays for the strains tested, 

along with their specific AA formations. 
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TABLE 11 

Specific Enzyme Activities (mU)* of Selected Mutant Pmtotheca Strains 



5 



10 



Strain 


AA Soecific 
Form, mg/g 


PMI 


PMM 


PFK 


GMP 


Epimerase 


Mg- 
Mmited 


Mg- 
suffictent 


UV164-6 


78.4 












0.79 


EMS13-4 


73.7 


10.8 


69.6 


13.5 


2.6 


6.8 


0.78 


UV140-1 


69.9 












0.78 


NA45-3 


61.4 












0.58 


UV77-247 


44.4 












0.52 


UV1 27-10 


40.1 


11.1 


45.8 


24.4 


4.3 


5.9 


0.39 


UV244-15 


24.5 


14.3 


41.5 




3.1 


5.3 


0.42 


NA21-14 


23.6 


12.1 


60.3 


47.4 


2.4 


7.6 


0.27 


ATCC 75669 


21.9 












028 


UV244-16 


5.0 


16.5 


85.6 




4.3 


5.2 




SP2-3 


2.0 


17.7 


47.0 


64.5 


2.0 


7.5 


0.03 


UV218-1 


0.4 


15.9 


72.1 




2.7 


7.0 


0.83 


UV213-1 


0.1 


19.7 


47.7 


32.6 


Z2 


6.7 


0.60 


UV82-21 


0.0 


14.6 


70.6 


30.4 


4.1 


7.5 


0.15 


UV244-1 


0.0 


18.2 


51.1 




5.5 


12 


0.15 



Unite: PMI and PMM, nmotes NADP reduced per min/mg protein; PFK, nmoles IMADH oxkized per 
20 min/mg protein; GMP, nmotes GDP-D-mannose formed per min/mg protein; epimerase, nmoles 

GDP-L-gaJactose formed per min/mg protein. 

The only enzyme which showed a strong correlation between activity and the 
ability to synthesize AA was the GDP-D-mannose:GDP-L-galactose epimerase. This 
correlation is depicted in Fig. 5. All of the strains which produced measurable amounts 

25 of AA had measurable amounts of epimerase activity. The converse was not true: four 
of the strains which synthesize little or no AA had significant epimerase activities. These 
strains are candidates for having mutations which affect enzymatic steps downstream from 
the epimerase. Since all of the strains tested can synthesize AA from L-galactose and 
L-galactono- y -lactone (see Examples 4 and 5), the genetic lesion(s) in these four mutants 

30 must lie between GDP-L-galactose and free L-galactose. 
Example g 

The next example shows the relationship between GDP-D-mannose: GDP-L- 
galactose epimerase activity and the degree of magnesium limitation in two strains, the 
original unmutagenized parent strain ATCC 75669, and one of the best AA producers, 
35 EMS13-4(ATCC ). 



WO 99/646 1 8 PCI7US99/1 1 576 

72 

Four flasks of each strain were grown according to the standard protocol. One 
culture of each was harvested 24 hours into magnesium-limited incubation, and every 24 
hours thereafter for a total of four days. One flask of each strain was also harvested 24 
hours into magnesium sufficient incubation. All cultures had glucose remaining when 
5 harvested. Fig. 6 shows graphically the AA productivity and epim erase activity in 
EMS 13-4 and ATCC 75669 as the cultures became Mg-limited. Epimerase activity in 
EMS13-4 was significantly greater than that in ATCC 75669 at all time points. There was 
also a concurrent rapid rise in both AA productivity and epimerase activity in EMS 13-4 
as the cultures became increasingly Mg-limited. While there was a moderate increase in 
10 AA productivity in ATCC 75669 as Mg became more limiting, there was no effect on 
epimerase activity. 
Example? 

The following example shows the results of epimerase assays performed with 
extracts of two K coli strains into which were cloned the K coli gene for GDP-4-keto-6- 
15 deoxy-D-mannose epimerase/reductase. 

The E. coli K12 wca gene cluster is responsible for cholanic acid production; 
wcaG encodes a GDP-4-keto-6-deoxy-D-mannose epimerase/reductase. 

The K coli wcaG sequence (nucleotides 4 through 966 of SEQ ID NO: 3) was 
amplified by PCR from K coli W3 1 10 genomic DNA using primers WG EcoRI 5 (5' 
20 TAGAATTCAGTAAACAACGAGTTTTTATTGCTGG 3'; SEQ ID NO: 12) and WG 
Xhol 3 (5* AACTCGAGTTACCCCCAAAGCGGTCTTGATTC 3*; SEQ ID NO: 13). 
The 973-bp PCR product was ligated into the vector pPCR-Script SK(+) (Stratagene, 
LaJolla, CA). The 973-bp ExoRII/XhoI fragment was moved from this plasmid into the 
ExoRII/XhoI sites of pGEX-5X-l (Amersham Pharmacia Biotech, Piscataway, NJ), 
25 creating plasmid pSW67-l . Plasmid pGEX-5X-l is a GST gene fusion vector which adds 
a 26-kDa GST moiety onto the N-tennmal end of the protein of interest. E. coli 
BL21(DE3) was transformed with pSW67-l and pGEX-5X-l, resulting in strains 
BL21(DE3)/pSW67-l and BL21(DE3)/pGEX-5X-l. 

The£ coli wcaG sequence (nucleotides 1 through 966 of SEQ ID NO 3) was also 
30 amplified by PCR from E. coli W3 1 10 genomic DNA using primers WG EcoRI 5-2 (5' 
CTGGAGTCGAATTCATGAGTAAACAACGAG 3'; SEQ ID NO: 14) and WG PstI 3 
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(5' AACTGCAGTTACCCCCGAAAGCGGTCTTGATTC 3'; SEQ ID NO; 15). The 976- 
bp PCR product was ligated into a pPCR-Script (Srxatagene). The 976-bp ExoRlI/PstI 
fragment was moved from this plasmid into the ExoRlI/PstI sites of expression vector 
pKK223-3 (Amersham Pharmacia Biotech), creating plasmid pSW75-2. E. coli JM105 
5 was transformed with pKK223-3 and pSW75-2, resulting in strains JM 1 05/pKK223-3 and 
JM105/pSW75-2. 

All six strains were grown in duplicate at 37°C with shaking in 2X YTA medium 
until an optical density of 0.8-1.0 at 600 nm was reached (about three hours). 2X YTA 
contains 16 g/L tryptone, 10 g/L yeast extract, 5 g/L sodium chloride and 100 mg/L 

10 ampicillin. One of each culture was induced by adding isopropyl p-D- 
thiogalactopyranoside (IPTG) to 1 mM final concentration. All 12 cultures were 
incubated for an additional four hours, washed in 0.9% NaCl, and the cells were frozen 
at -80 °C. Prior to pelleting the cells for preparation of extracts, a portion of each culture 
was used for a plasmid DNA miniprep to confirm the presence of the appropriate plasmids 

15 in these strains. A protein preparation of each culture was also run on SDS gels to 
confirm expression of a protein of the appropriate size where expected. Frozen pellets 
were thawed, resuspended in 2.5 mL MOPS/EDTA buffer, pH 7.2, broken in a French 
Press (10,000 psi), spun for 20 min at 20,000 x g, assayed for protein as above and diluted 
to 0.01, 0.1, 1.0 and 3 mg/mL protein. 

20 Induction of the strain BL2 1 (DE3)/pGEX-5X- 1 resulted in high-level expression 

of a 26-kDa protein indicating the synthesis of the native GST protein. Induction of strain 
BL21(DE3ypSW67-l resulted in high-level expression of a 62-kDa protein, indicating the 
synthesis of the native GST protein (26K) fused to the wcaG gene product (36K). An 
aliquot of the fusion protein was treated with the protease Factor Xa (New England 

25 Biolabs, Beverly, MA), which cleaves near the GST/wcaG junction. Induction of the 
strain JM105/pSW75-2 resulted in high level expression of a 36-kDa protein, indicating 
the synthesis of the wcaG gene product. No such protein was detected in 
JM105/pKK223-3 (vector only). 

Next, it was of interest to test extracts in the standard epimerase assay described 

30 in Example 7 to determine if any of the extracts containing the wcaG product could bring 
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about the conversion of GDP-D-mannose to GDP-L-galactose. The extracts to be 

assayed are: 

BL2KPE3) Group 

1 . BL2 1 (DE3) uninduced 

5 2. BL21(DE3) induced with ImM IPTG 

3. BL2 1 (DE3)/pGEX-5X- 1 uninduced 

4. BL21(DE3)/pGEX-5X-l induced with ImM IPTG 

5 . BL21 (DE3)/pSW67-l uninduced 

6. BL2 1 (DE3ypS W67- 1 induced with 1 mM IPTG; fusion protein intact 

10 7. BL21(DE3)/pSW67-l induced with 1 mM IPTG; GST moiety cleaved 

JM1Q5 Group 

1 . JM1 05 uninduced 

2. JM105 induced with ImM IPTG 

3 . JM1 05/pKK223-3 uninduced 

15 4. JM105/pKK223-3 induced with 1 mM IPTG 

5. JM105/pSW75-2 uninduced 

6. JM105/pSW75-2 induced with 1 mM IPTG 

Extracts 1 and 7 from the BL21(DE3) group and extracts 1 and 6 from the JM105 
group were tested for GDP-D-nrannose: GDP-L-galactose epimerase-like activity in a pilot 

20 experiment. In this initial experiment, no epimerase activity was detected in any of the 
extracts. At this time, such a result can be attributed to a number of possibilities. First, 
it is possible that the wcaG gene product is incapable of catalyzing the conversion of 
GDP-D-mannose to GDP-L-galactose, although this conclusion can not be reached until 
several other parameters are tested. Second, it is possible that under the assay conditions 

25 which are satisfactory to measure activity for the endogenous GDP-D-mannose:GDP-L- 
galactose epimerase, the wcaG gene product does not have GDP-D-mannose:GDP-L- 
galactose epimerase-like activity. Therefore, alternate conditions should be tested. 
Additionally, confirmation experiments should be performed to confirm the accuracy of 
the pilot conditions. Third, although the BL21(DE3) and the JM 105 clones produce 

30 proteins of the expected size, the constructs have not been sequenced to confirm the 
proper coding sequence for the wcaG gene product and thereby rule out PCR or cloning 
errors which may render the wcaG gene product inactive. Fourth, the protein formed 
from the cloned sequence is full-length, but inactive, for example, due to incorrect tertiary 
structure (folding). Fifth, the gene is overexpressed, resulting in accumulation of insoluble 

35 and inactive protein products (inclusion bodies). Future experiments will attempt to 
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determine whether the constructs have or can be induced to have the ability to catalyze 
the conversion of GDP-D-mannose to GDP-L-galactose, and to use the sequences to 
isolate the endogenous GDP-D-mannose : GDP-L-galactose epimerase. 

Table 12 provides the atomic coordinates for Brookhaven Protein Data Bank 
5 Accession Code lbws: 

TABLE 12 



Bfi&QfiB EgPffiRASB/RSPPCXftSB 27-SEp-gg 1EH5 

TITLE CRYSTAL STRUCTURE OF GDP-4-KETO-6-DEOXY-D-MANNOSE 

TITLE 2 EPIMERASE/REDUC TASE FROM ESCHERICHIA COLI A KEY ENZYME IN 

10 TITLE 3 THE BIOSYNTHESIS OF GDP-L-FUCOSE 

CCMPNP MOLID: 1; 

COMPND 2 MOLECULE: GDP-4-KETO-6-DEOXY-D- MANNOSB EPIMERASE/ REDUCTASE; 

CiMEME 3 CHAJWt to 

COWPNP ENGIHEEREP; YfiSf 

15 COMPNP 5 PI9LQGICAL_UNIT; HOMQpnffiR 

SOURCE MOL_ID: 1: 

SOURCE 2 ORGANISMJ5CIENTIFIC: ESCHERICHIA COLI; 

SOURCE 3 EXPRESSION SYSTEM: ESCHERICHIA COLI 

KEYWDS EPIMERASE/REDUCTASE . GDP-L-FUCOSE BIOSYNTHESIS 

20 SXPPTA X-RAY PIPTRACTION 

AUTHOR DE M-RIZZITONETTIFLORA 

EEYBAI 1 13-JAN-99 ibws Q 

JJBH1 AIZTH DE D. RIZZITONETTIVIGEVANISTURLABISSQFLORA 

JRNL TITL GD P- 4 - KKTO- 6- DBOXYD-MANNOS E EPIMERASE/REDUCTASE 

25 iZML TTTIi 2 FROM ESCHERICHIA COLI. A KEY ENZYME IN THE 

JRNL TITL 3 BIOSYNTHESIS OF GDP-L-FUCOSE, DISPLAYS THE 

JRNL TITL 4 STR UCTURAL CHARACTERISTICS OF THE RED PROTEIN 

3B2SL TITL 5 HOMOLOGY SUPERFAMILY 

JRNL REF STRUCTURE f LONDON) U98 

30 JKNIh RE FN 9999 



EBS&rk. 1 

REMARK 2 

REMARK 2 RESOLUTION. 2.2 A NGSTROMS. 

REMARK 3 

35 RBUftRK 3 REFINEMENT- 

3 PROGRAM ; TNT 

REMARK 3 AUTHORS : TRONRUD . TEN EYCK ■ M ATTHEWS 

REMARK 3 

REMARK 3 DATA USED IN REFINEMENT. 

40 REMARK 3 RESOLUTION RANGE HIGH ( ANGSTROMS) : 2.2 
REMARK 3 RESOLUTION RANGE LOW [ANGSTROMS) ; 15, Q 
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BEMARK 


3 


DATA CUTOFF (SIGMA(F)l : 0.0 








BEMA£K_ 


3 


COMPLETENESS FOR RANGE <%) : 99.7 








BEMAR£_ 


3 


NUMBER OF REFLECTIONS : 244 Bl 








REMARK— 


3 








5 


BfiMA£E_ 


3 


USING DATA ABOVE SIGMA CUTOFF. 








REMARK 


3 


CROSS-VALIDATION METHOD : NONE 








REMARK 


3 


— FREE R VALUE TEST SET SELECTION : NULL 








REMARK 




R VALUE (WORKING + TEST SET) : NULL 








REMARK. 


3 


R VALUE (WORKING SET) : NONE 






10 


REMARK— 


3 


FREE R VALUE : NULL 








REMARK 


3 


FREE R VALUE TEST SET SIZE f%> : NONE 








REMARK. 


3 


FREE R VALUE TEST SET COUNT : NULL 








REMARK 


3 










REMARK 


3 


USING ALL DATA. NO SIGMA CUTOFF. 






15 


REMARK 




— R VALVE (working + TEST 5ETr NO CUTOFF) 


; null 






REMARK 




R YAtUE (WORKING SET, HO CUTOFF) 


; 9,202 






£EMABK_ 


3 


FREE R VALUE (NO CUTOFF) 


: 0.287 






REMARK 


3 


FREE R VALUE TEST SET SIZE (% . NO CUTOFF) 


: NULL 






REMARK 


3 


FREE R VALUE TEST SET COUNT (NO CUTOFF) 


: NULL 




20 


REMARK 


3 


TOTAL NUMBER OF REFLECTIONS (NO CUTOFF) 


t NVLL 






REMARK 


3 










REMARK 


3 


NUMBER OF NON-HYDROGEN ATOMS USRD TN RF.FTNFWRNT . 






REMARK 


3 


PROTEIN ATOMS : 2527 








EEMARK_ 


3 


NUCLEIC ACID ATOMS : NULL 






25 


REMARK 


3 


OTHER ATOMS : 109 








REMARK 


3 










REMARK 


3 


WILSON B VALUE (FROM FCALC A**2) : NULL 








REMARK 


3 










REMARK 


3 


RMS DEVIATIONS FROM IDEAL VALUES. RMS 


WEIGHT 


COUNT 


30 


REMARK 




BOND LENGTHS (A) : 0.016 ; 


NULL ; 


NULL 




REMARK 


3 


BOND ANGLES (DEGREES) : 1.65 ; 


NULL ; 


NULL 




REMARK 


3 


TORSION ANGLES (DEGREES) : NULL t 


NULL ; 


NULL 




REMARK 


3 


PSEUDOROTATION ANGLES (DEGREES) : NULL ; 


NULL ; 


NULL 




REMARK 


3 


TRIGONAL CARBON PLANES (A) : NULL z 


NVLL ; 


NULL 


35 


REMARK 


3 


GENERAL PLANES (A) : NULL ; 


NULL f 


NVLL 




£EMA£E__ 


3 


ISOTROPIC THERMAL FACTORS (A**2) : NULL ; 


NVLL f 


NVLL 




£fiMA£K_ 


3 


NON-BOWDEp CONTACTS (A) : NULL ; 


NULL ; 


NULL 




REMARK 


3 










REMARK 


3 


INCORRECT CHIRAL CKNTERS (COUNT) ; NULL 






40 


REMARK 


3 










REMARK 


3 


BULK SOLVENT MODELING. 








REMARK 


3 


METHOD USED : NULL 








REMARK 


3 


KSOL : NULL 








REMARK 


3 


BSOL : NULL 






45 


REMARK— 


3 
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REMARK 3 RESTRAINT LIBRARIES. 

REMARK 3 STEREOC HEMISTRY ; NULL 

REMARK 3 ISOTRO PIC THERMAL FACTOR RESTRAINTS : NULL 

BOOBE 2 

5 REMARK 3 OTHER REFINEMENT REMARKS; NULL 

REMARK. i 

REMARK 4 1BW5 COMPL IES WITH FORMAT V. 2,2. 16-DEC-1996 

REMARK. 5 

REMARK 5 WARNING 

10 REMARK 5 1BWS: THIS IS LAYER 1 RELEASE, 

REMARK 5 

REMARK 5 PLEASE NOTE THAT THIS ENTR Y WAS RELEASED AFTER DEPOSITOR 

REMARK 5 CHECKING AND APPRO VAL BUT WITHOUT PDB STAFF INTERVENTION. 

REMARK 5 AN AUXILIARY FILE, AUX1BWS.RPT. IS AVAILABLE FROM THE 

15 REMARK 5 PDB FTP SERVER AND I S ACCESSIBLE THROUGH THE 3DB BROWSER. 

REMARK 5 THE FILE CONTAINS THE OU TPUT OF THE PROGRAM WHAT CHECK AND 

REMARK. 5 OTHER DIAGNOSTICS, 

REMARK 5 

REMARK 5 NOMENCLATURE IN THIS ENTRY. INCLUDING HST RESID U E NAMES 

20 REMARK 5 AND HET AT OM NAMES. HAS NOT BEEN STANDARDIZED BY THE PDB 

REMARK 5 PROCESSING ST AFF. A LAYER 2 ENTRY WILL BE RELEASED SHORTLY 

REMARK 5 AFTER THIS STAN DARDIZATION IS COMPLETED AND APPROVED BY THE 

REMARK 5 DEPOSITOR. THE LAYER 2 ENTR Y WILL BE TREATED AS A 

REMARK 5 CORRECTION TO THIS ONE. WITH THE APPRO PRIATE REVDAT RECORD. 

25 REMARK. 5 . 

REMARK 5 FURTHER INFORMATION INCLUDING VALIDATION CRITERIA USED IN 

REMARK 5 CHECKING T HTS ENTRY AND A LIST OF MANDATORY DATA FIELDS 

REMARK 5 ARE AVAILABLE FROM THE PDB WEB SITE AT 

REMARK 5 http://WWW.PDB.BNL.GOV/, 

30 REMARK 200 

REMARK 2PQ EXPERIMENTAL pETAILS 

REMARK 200 EXPERIMENT TYPE ; X . -RAY PIFFRACTION 

REMARK 200 DATE OF DATA COLLECTION I AVg-I9ft7 

REMARK 200 TEMPERATURE (KELVIN) ? 12 P 

35 REMARK 200 PH 

REMARK 200 NUMBER OF CRYSTALS USED i_l 

RgtfARK 2QQ 

REMARK ZOO SYNCHROTRON (Y/N) ? N 

REMARK 200 RADIATION SOURCE i NONE 

40 REMARK 200 BEAMLINE i NULL 

REMARK 200 X-RAY GENE RATOR MODEL ? RISftKU RU20Q 

REMARK 200 MONOCHROMATIC OR LAUE CM/D ; M 

REMARK 200 WAVELENGTH OR RANGE (A) ; 1,5418 

REMARK 200 MONOCHROMATOR 1 — NULL 

45 REMARK gOQ OPTICS : HULL 
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REMARK 


2Q0 


DETECTOR TYPE 


: IMAGE PLATE 




REMARK 


290 


DETECTOR MANUFACTURER 


: RAXIS 




RQJARK 


299 


INTENSITY- INTEGRATION SOFTWARE 


: MOSFLM 


5 


REMARK 


299 


DATA SCALING SOFTWARE 


: SCALA 




REMARK 


290 








REMARK 


2QQ 


NUMBER OF UNIOUE REFLECTIONS 


: 244B1 




REMARK 


299 


RESOLUTION RANGE HIGH (A) 


: 2.2 




REMARK 


290 


RESOLUTION RANGE LOW (A) 


: 15.0 


10 


REMARK 


299 


REJECTION CRITERIA (SIGMA fll ) 


: NONE 




REMARK 


299 








REMARK 


200 








REMARK. 


299 


COMPLETENESS FOR RANGE (%) 


: 99.7 




REMARK, 


290 


DATA REDUNDANCY 


: 4.3 


15 


REMARK 


299 


R MERGE (I) 


: 0.057 




REMARK 


299 


R SYM (I) 


: NONE 




REMARK. 


290 


<I/SIGMA(I)> FOR THE DATA SET 


: 13. 6 




REMARK. 


299 








REMARK 


200 


IN THE HIGHEST RESOLUTION SHELL. 




20 


REMARK 200 


HIGHEST RESOLUTION SHELL, RANGE 


HIGH (A) : NULL 




RfiMARK. 


200 


HIGHEST RESOLUTION SHELL. RANGE 


LOW (A) : NULL 




REMARK- 


299 


COMPLETENESS FOR SHELL (%) 


: NULL 




REMARK 


200 


PATA REDUNDANCY Iff SHELL, 


: NULL 




REMARK 200 


R MERGE FOR SHELL (I) 


: NULL 


25 


REMARK. 


299 


R SYM FOR SHELL (I) 


: NULL 




REMARK. 


299 


<I/SIGMA(I1> FOR SHELL 


: NULL 




REMARK. 


200 








REMARK 


290 


DIFFRACTION PROTOCOL: NULL 






REMARK 200 METHOD USED TO DETERMINE THE STRUCTURE: MIR 


30 


REMARK 


299 


SOFTWARE V5EP; null. 






REMARK- 


299 


STARTING MOPBI-; WILL 






REMARK 


290 








REMARK- 


209 


REMARK: NULL 






REMARK 


290 






35 


REMARK- 


2?o 


CRYSTAL 






REMARK- 


299 


SOLVENT CONTENT. VS {%): NULL 






REMARK- 


2B0 MATTHEWS COEFFICIENT. VM f ANGSTROMS**.! /DA) : NULL 




REMARK 


299 








REMARK- 


299 


QRYSTAMiatflON CONDITIONS; NULL 




40 


REMARK- 


299 








REMARK- 


299 


CRYSTALIOGRAPHIC SYMMETRY 






REMARK- 


2?o 


SYMMETRY OPERATORS FOR SPACE GROUP: P 32 2 1 




REMARK- 


299 








REMARK- 


290 


SYMOP SYMMETRY 




45 


REMARK- 


299 


NNNMMM OPERATOR 
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10 



nnrrl" 
REMARK 290 


2555 


A, I, 2. 

-Y,X~Y.Z+2/3 


REMARK 29Q 


3555 


Y-X,-X.Z+l/3 


REMARK 290 


4555 


Y,X.-Z 


REMARK 299 


5555 


X-Y.-Y.1/3-Z 


REMARK 290 


6555 


-X.Y-X.2/3-Z 


REMARK 290 


REMARK 290 


WHERE NNN -> OPERATOR NUMBER 


REMARK 290 


MMM -> TRANSLATION VECTOR 


REMARK 290 


REMARK 290 


rRYSTAVhOtfRAPHyr; SYMMETRY TRANSFORMATIONS 



REMARK 


290 


RECORDS IN 


THIS ENTRY TO PRODUCE 


CRYSTALLOGRAPHICALLY 


REMARK 290 


RELATED MOLECULES. 








REMARK 


290 


5MTRY1 


1 


i. 999999 


0.000000 


9.999990 


o t 9999P 


REMARK 


290 


5MTRY2 


1 


9.999999 


It999999 


0 .000000 


9rP990p 


REMARK 


299 


SMTRY3 


% 


0-999999 


0. 000000 


1,999999 


0. ooooo 


REMARK. 


290 


SMTRY1 


z 


-0-500045 


-0.865974 


0.000000 


0.00000 


REMARK 


290 


SMTRY2 


2 


0.866077 


-0.499955 


0.000000 


0.00000 


REMARK 


290 


SMTRY3 


2 


0.000000 


0.000000 


1.999999 


50 . 58553 


REMARK 


299 


SMTRY1 


? 


-9.199955 


o f ?65974 


0.000000 


9.99990 


REMARK. 


299 


5MTRI2 


3 


-9. 966977 


-o T 599945 


9.999P99 


9.99999 


REMARK 


299 


SMTRY? 


3 


9.999999 


o t oppp99 


1.999099 




REMARK. 


299 


SMTRY1 


4 


-0 r 500p45 


9.96592? 


0,999999 


Q.*PP.Q0P 


REMARK 


299 


SMTRY2 


4 


0.866077 


0.500045 


0.000000 


n nnnnn 
u . uuuuu 


REMARK. 


2?0 


SMTRY3 


4 


0.000000 


0.000000 


-1.000000 


0 . 00000 


REMARK. 


290 


SMTRY 1 


5 


1.000000 


0.000104 


0.000000 


0.00000 


REMARK 


290 


SMTRY2 


5 


0.000000 


-1.000000 


0.000000 


0.00000 


REMARK 


290 


5MXRY3 


5 


0.000000 


0.000000 


-1.999999 


25-29276 


REMARK 


290 


smtry; 


6 


-9.499955 


-0.866026 


0.999990 


0.00000 


REMARK 290 


SMTRY? 


6 


-9.966977 


0,499955 


0.000000 


9,99099 


REMARK 


2.90 


SMTRY3 


6 


0-999999 


0.000000 


-1,999999 


50.58553 


REMARK. 


299 














REMARK. 


299 


REMARK: NULL 










REMARK- 


165 














REMARK- 


465 


MISSING RESIDUES 








REMARK 


465 


THE FOLLOWING 


RESIDUES 


WERE NOT LOCATED IN THE 




REMARK. 


465 


EXPERIMENT t 


(MrMQDEL NVMBER,- RES= 


RBSIPVP NAME; 


C=CHAIN 


REMARK- 


465 


IDENTIFIER; 


SSSEO=SEOUENCE NUMBER 


; I=INSERTION 


CODE* : 


REMARK. 


165 














REMARK 


165 


M RES C SSSEOI 








REMARK 


465 


MET A 




1 








REMARK 


MS 


SER A 




2 








REMARK 


165 


ASP A 


317 








REMARK. 


165 


ARG A 


318 









15 



20 



25 



30 



35 



40 



45 
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remark. 


465 


PHE 


A 319 




REMARK. 


465 


ARC A 320 




BEUtfK 


465 


GLY 


A 321 




BEHARK 


800 






5 


REMARK 


800 SITE 






REMARK 900 SITE IDENTIFIER: CAT 




REMARK 


800 SITE DESCRIPTION: 




REMARK. 


800 CATALYTIC RESIDUE 




remark 


800 






10 


REMARK 


800 SITE IDENTIFIER: CAT 




REMARK 800 SITE DESCRIPTION: 




REMARK 


800 CATALYTIC RESIDUE 




REMARK 


800 








REMARK 


800 SITE IDENTIFIER: CAT 


15 


REMARK 


8QQ SiTEjEgCRIPTION; 




REMARK 


800 CATALYTIC RESIDUE 




RBHARK 
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HETATM 55 O HOH 67 42.746 25.153 23.465 1.00 27.05 2 

HETATM 56 O HQH 68 53.638 -16.457 32.292 1-00 31.71 Q 

HETATM 57 O HOH 69 33.390 41.716 31.408 l.QQ 29,92 Q 

HETATM 58 O HOH 70 5 7.768 17.B97 42.434 1 . 00 25.75 Q 

5 HETATM 59 O HOH 71 75.647 9,164, 11*1$$ 1,00 35,13 2 

HETATM 60 O HOjJ 22 62.032 33.292 44.749 1,00 46,19 2 

hetatm 61 o HQH 12 47.31Q 14,312 31.295 1,QQ 31,19 Q 

HETATM 62 O HOH 74 79.660 -3.947 15.913 1.00 34.63 2 

HETATM 63 O HOH 75 46.929 5.343 4.55Q 1 T QQ 23,14 2 

10 HETATM 64 O HOH 76 73.475 12 .039 28.412 l.QQ 27,26 Q 

HETATM 65 o HQH 7J 46,297 -6,992 30.032 l.QQ 43,41 2 

HETATM 66 Q HOH 78 68.528 -3.422 40.869 1 T OP 38-47 2 

HETATM 67 Q HOH 79 62.080 -1.448 42.903 l.QQ 24.60 Q 

HETATM 68 O HOH 80 65.330 18,150 40.726 1.00 41, QQ 2 

15 HETATM 69 O HOH 81 51.775 16.128 37.607 1-00 25, U 2 

HETATM 70 O HOH 83 54.266 28.682 43.313 1.00 27.61 2 

HETATM 71 O HOH 85 7 3.291 -15.479 20.603 1.00 37.54 2 

hetatm 72 o hoh 8 6 34,76Q 21.479 29,544 1,QQ 43.97 p_ 

HETATM 73 O HOH 87 37.326 24.131 29.677 l.QQ 24.47 2 

20 HETATM 74 O HOH 86 65.168 20.148 6.735 l t QP 26,10 O 

HETATM 75 O HOH 89 59.196 12.089 13.630 1.00 25.24 Q 

HETATM 76 O HOH 91 66.576 -6.235 40,279 1.00 43.11 2 

hetatm 77 o hoh 93 37.339 29.394 25x515 l.QQ 27.56 2 

HETATM 78 O HOH 94 52.339 -17.014 42.271 1.00 48.96 O 

25 HETATM 79 O HOH 95 40.511 32. 927 31.717 1,QQ 22.46 2 

HETATM 80 O HOH 96 78.580 13.121 34.138 1.00 27.98 2 

HETATM 81 O HOH 97 65.090 15.704 34.876 l.QQ 18.96 Q 

HETATM 82 O HOH 99 84.562 2,951 27.191 It QQ 35-92 2 

HETATM 83 O HOH 1Q0 5Q.3B6 9,761 9,646 \ . QQ 23,18 Q 

30 HETATM 84 O HOH 101 67.649 -0. 851 38.764 1-QQ 2*. 99 2 

HETATM 85 O HOH 102 44.001 4-293 34.315 1-OQ 31.13 O 

hetatm 86 o hoh 103 59.396 -5,071 26.211 1-OQ 29.10 o 

HETATM 87 Q HOH 104 77.364 4.745 41.506 1.00 35.32 2 

HETATM 88 Q HOH 105 59.034 21.201 32.414 1,QQ 23-4? 2 

35 HETATM 89 O HOH 106 42.463 34.698 14.327 l.QQ 39-96 2 

hetatm 90 o hoh 107 ?P,217 14i292 20.964 — l.QQ 42.39 2 

hktatm 91 o hoh lflfi 76.999 9.130 25.962 l.QQ 32.91 2 

HETATM 92 Q HOH 109 49.766 29.937 22.173 1.00 42.52 Q 

HETATM 93 Q HQH 110 72.473 13. 536 38.823 1,00 33,32 O 

40 HETATM 94 O HQH 111 64.328 -12.084 38.609 1 T Q0 37.99 2 

HETATM 95 O HOH 112 60.161 16.382 42.682 1.00 35.68 Q 

HETATM 96 Q HOH 113 47.602 13. 639 27.016 1.00 26,01 2 

hetatm 97 Q HOH 115 64.606 11.644 40.107 1.00 30.33 2 

HETATM 98 Q HOH 116 61.231 -15, 137 27.255 l-OQ 39.76 2 

45 HETATM 99 O HOH 117 65.324 -11.223 35.09B 1,00 30-45 2 
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HETATM 1QQ 9 HQH 119 56.602 17.219 44.932 1.00 36.53 2 

HETATM 101 9 HQH 120 37.564 19.8 60 23.135 1.00 31.27 O 

HETATM 102 O HQH 121 6 1,9 4 5 5.057 21.132 l.OO 45.57 O 

HEE&2M 103 O HQH 123 63.391 16.801 26.898 1.00 39.46 Q 

5 HEXftHi 1Q4 9 HQH 12J 42.567 6. 134 32.635 1.00 31.56 Q 

HETAIM 1Q5 9 HQfl 125 72.485 13.236 35.059 1.00 29.61 O 

HETATM 1P6 9 HQH 12$ 55,22? 3.650 44.032 l.OO 36. B6 o 

HETAIM 1Q7 Q HQH IZ1 37.9S? 7.148 31.0B3 1.00 39. SB Q 

HETATM 108 O HQH 128 73.327 10.546 12.123 1.00 34.97 O 

10 HETATM 109 O HQH 129 74.450 10.299 26.598 1.00 30.80 Q 

HETATM 110 AOS* HDP A 1 67.524 13.055 26.692 1.00 36.42 Q 

HETATM. 111 A.C5* NPP A 1 68 .99 9 12.297 25.614 1.00 9.30 £ 

HETATM 112 AC4* NDP A 1 69.601 12.124 25.858 1.00 27.73 £ 

HETATM 113 AQ4* HDP A 1 70.193 11.258 24.848 1.00 22.87 O 

15 HETAIM 111 AC3* NPP A 1 7Q.484 13.390 25.873 1.00 17.83 £ 

HETATM 115 AQ?» W p A 1 71.192 13.436 27.066 1.00 16.11 Q 

HETATM 116 AC2* HDP A 1 71.373 13.220 24.626 1.00 11.46 £ 

HETATM 117 A02* NPP A 1 72.623 13.886 24.655 1.00 31.96 Q 

HETATM 118 ACl* HDP A 1 71.510 11.702 24.656 1.00 19.02 C 

20 HETATM 119 Q3 NPP A 1 65.336 13.590 26.129 1.00 20.59 0 

HETATM 120 WQ5* NDP A 1 63.536 11.943 26.44 8 1.00 28.99 O 

HETATM 121 NC5* NPP A 1 64.328 10.843 25.95 7 1.00 24.89 £ 

HETATM 122 K£4* HDP A 1 63.467 9.646 25.686 1.00 31.79 £ 

HETATM 123 NQ4* flDP A 1 6 2.937 9.337 26.90B 1.00 28.82 Q 

25 HETATM 124 HC3* NDP A 1 62.340 9.837 24.665 1.00 11.50 C 

HETATM 125 N93* NPP A 1 62.891 9.402 23.461 1.00 28.60 Q 

HETATM 126 NC3* NPP A 1 61.152 8.996 25.138 1.00 28.11 c 

HSTATM 127 NQ2* HDP A 1 60.881 7.662 24.715 1.00 24.30 O 

HETATM 129 N91 + NPP A 1 61,547 8.875 26.580 l.OO 35.35 C 

30 HETATM 1?9 AF2* NPP A 1 73.104 15.069 23.823 1.00 32.96 £ 

HETATM 13Q A9P1 NPP A 1 74.500 15.308 24. 308 1.00 37.84 Q 

HETATM 131 AOP2 NDP A 1 72.797 14.925 22.348 1.00 36.66 Q 

HETATM 132 AOP3 HDP A 1 72.163 16.217 23.958 1.00 31.97 Q 

HETATM 133 AP NPP A 1 66.660 14.257 26.393 1.00 26.17 XX 

35 HETATM 134 AQl NDP A 1 66.886 14.795 25.047 1.00 15.31 X£ 

HETATM 135 A92 NPP A 1 66.439 15.207 27.521 1 .00 34.39 XX 

HETATM 136 AN9 NDP A 1 71.820 11.224 23.353 1.00 13.63 X£ 

HETATM 137 ACS NDP A 1 71.104 11.316 22.200 1.00 12.41 X£ 

HETATM 139 AN7 NPP A 1 71.758 10.835 21.161 1.00 15.71 X£ 

40 HETATM 13? acs NPP A 1 72.933 1Q T 313 21 t 7lQ j.oo 16,17 xjc 

HETATM 14 0 AC6 NDP A 1 74.053 9.657 21.140 1.00 31.35 JOC 

HETATM 141 AW6 NPP A 1 71,165 9,464 19.819 1.00 12.59 X£ 

HETATM 142 AN1 NPP A 1 75.078 9.280 21.942 1.00 17.56 X£ 

HETATM 143 AC 2 NDP A 1 74.971 9.578 23.251 1.00 15.44 X* 

45 HETATM 144 AN3 NDP A 1 74.027 10.302 23.889 1.00 24. B? XJC 
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HETATM 145 AC4 NDP A 1 73.036 10. 653 23.047 1.00 17.48 

HETATM 146 NF MPP A 1 €4.193 13,1Q6 27,191 l.QQ 25,47 M 

HETATM 117 NCI WPP A 1 $3,142 14.169 27.253 1.00 28.69 

HETATM 148 NQ2 NDP A 1 64.837 12.643 28.49 2 1.00 24.32 N_ 

5 HETATM 149 NNl NDP A 1 SP,59B 9.775 27.109 1.00 23.63 

HETATM 150 NC2 HDP A 1 60.143 10.905 26.442-99.00 78.36 H 

BEXARi 151 NC3 M DP A 1 59.070 11.648 27.007-99.00100.00 H 

HEXATM 152 NC7 NPP A 1 58.497 13.017 26.528-99.00100.00 H 

HETATM 153 NQ7 nop ft 1 59,359 13,793 25,972-99,00100,00 H 

0 HETATM 154 NN7 NDP A 1 57.207 13.400 26.912-99.00 84.38 M 

HETATM 155 NC4 NDP A 1 58.442 11.146 28.137-99.00100.00 H 

HETATM 156 NC5 NDP A 1 58.912 9.963 28.754-99.00100.00 H 

HETATM 157 NC6 NDP A 1 59.951 9.266 28.147-99.00100.00 H 

Aiaj 159 N LY5 A 3 76.227 -5.632 44.315 1.00 61.49 N_ 

5 ATOM 159 CA LYS A 2 76.152 -4.302 43.684 1.00 58.00 £ 

A1SW ;60 C L.YS A 3 75.985 -4,421 42 rlU l.QQ 52,79 £ 

AI2i l£t Q LYS A 3 76,921 -4,737 41.419 l.QP 44,76 Q 

AI£M 162 CB LYS A 3 77.359 -3.417 44.030 1.00 59.74 £ 

AISM 163 CG LYS A 3 77.011 -1.944 44.314 1.00 50.87 £ 

0 AXOl 164 CP LYS ft 2 78.208 -1.161 44.894 1.00 61.21 £ 

AXOI 165 CE LYS A 3 77.855 -0.377 46.186 1.00100.00 £ 

ATJ2M 166 MS LYS A 3 78.857 -0.401 47.343 1.00 70.61 

AXOS 167 W SUL-h 4 74.746 -4.242 41.747 1.00 45.15 H 

AXOI 16£ CA CItH A 4 74.408 -4.326 40.347 1.00 37.18 £ 

5 AlfiM 169 C GLN A 4 74.983 -3.166 39.561 1.00 34.93 C 

AlfiM 12fi 0 GLN A 4 75.127 -2.050 40.087 1.00 28.48 Q 

AIQM 171 CP <?LN A 4 72.915 -4.445 40.221 1.00 34.65 £ 

ATOM 122 CP GLN A 4 72.456 -5.854 40.584 1.00 31.82 C 

ATSE 173 CP GLN A 4 72,570 -6,798 39 T 405 lt pp 79,25 £ 

3 AlflM 174 QB1 CLN a 4 72,165 -6,452 39.286 1,PP1PQ.00 Q 

AX21 175 WE2 CM* A 4 73.206 -7.925 39.623 1.00 80.24 N. 

AISH 176. U ARG a 5 75,475 -3.495 39.375 1,00 27,16 M 

A12M 177 CA ARC A 5 76.146 -2.546 37.483 1.00 39.16 £ 

Aiai 178 C ARG A 5 75.191 -2.018 36.433 1.00 38.22 £ 

5 ATOJ 17? Q ARG A 5 74.93B -2.698 35.438 1.00 32.44 Q 

AXSM 18P C B A RG A 5 77.39B -3.163 36.826 1.00 41.76 £ 

ATCH 191 CG ARG A 5 78.692 -2.954 37.663 1.00 37.34 £ 

Aiaj 192 CP ARC A 5 9Q.P1S -3,236 36.976 i.po 32,99 £ 

ATCM 183 NE ARG A 5 81.036 -2.203 37.125 1.00 25.71 £f 

^ ATSM 184 CZ ARG A 5. 81.617 -1.488 36.169 1.00 32.53 £ 

AISM 185 NH1 ARG A 5 81.293 -1.704 34.904 1.00 40.07 N 

AtfiM 196 HH2 ARC A 5 82.516 -0.551 36.474 1.00100.00 N_ 

At£H 197 n VAL A 6 74.743 -p.773 36.65? j,po 32.0s n_ 

ATiM 188 CA VAL A £ 73.715 -0. 082 35.881 1.00 28.89 £ 

5 ATOM 189 C VAL A 6 74.161 1. 021 34.897 1.00 29.37 C 
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ATCH 190 O VAL A 6 74.745 2.041 35.274 1.00 22.50 O 

ATOM 191 CB VAL A 6 72.577 9,378 36.813 1,00 23,52 £ 

ATOM 192 CGI VAL A 6 71.366 9.969 36.906 1,99 20,29 £ 

ATCH 193 CG2 VAL A 6 72.108 -0.852 37,644 1.00 18.45 £ 

5 ATCM 194 H PHE A 7 73.948 9,749 33, 615 1,90 22.92 H 

ATCM 195 CA PHE A 7 74.267 1.710 32.573 1.00 27.15 £ 

ATCM 196 C PHE A 7 72.975 2.423 32.192 1.00 20.24 £ 

ATCM 197 O PHE A 7 71.994 1,799 31.815 1,99 29,71 Q 

ATCM 198 CB PHE A 7 74.864 1.004 31.374 1.00 18.98 £ 

10 atcm 199 cg phe A 7 74,916 1,936 39. 115 1,99 21,93 £ 

ATCM 200 CD1 PHE A 7 75.521 3 .087 30.108 1.00 19.36 C 

ATOM 201 CD2 PHE A 7 74.4B3 1.284 28.6 86 1.00 23.50 £ 

ATCM 202 CE1 PHE A 7 75.614 3.828 28.902 1.00 27.52 £ 

ATCM 203 CE2 PHE A 7 74.54B 1.996 27.685 1,99 19-33 £ 

15 atcm 204 cz phe a 7 75.128 3.255 27,673 1,99 19.59 £ 

ATCM 205 H ILEA 8 72.959 3 t 727 32,454. 1,99 19,75 U 

ATCM 206 CA ILE A 8 71,844 4,599 32rll2 1-00 H,25 £ 

ATCM 207 C ILEA 8 72.337 5.351 30.909 1.00 11.22 £ 

ATCM 208 O ILEA 8 73.259 6.165 30.998 1.00 17.76 O 

20 ATCM 209 CB ILE A 8 71.507 5.605 33.212 1.00 14.15 £ 

ATOM 210 CGI ILE A 8 71.356 4.949 34.582 1.00 8.24 £ 

AISH 2U CG2 I LB A g 79.193 6.34,2 32,974 1,99 16,95 £ 

ATC« 212 cpl ire a 8 7;. 09; 5,961 ?5 T 707 1,90 1Q.32 £ 

atom 213 w ALA a 9 71.996 4.996 29.752 1.99 16.42 H 

25 ATCM 214 CA ALA A 9 72.256 5,559 28,513 1.00 18,74 £ 

ATCM 215 C ALA A 9 71.530 6.913 28.511 1,00 28.45 £ 

ATCM 216 O ALA A 9 70.411 7.93? 29.945 1-99 22.39 O 

ATCM 217 CB ALA A 9 71.808 4.731 27.311 1.00 14.43 £ 

ATCM 218 W GLY A 10 72.199 7.922 27.949 1-00 20.96 U 

30 ATOM 219 CA GLY A 10 71.706 9.284 27.911 1,99 19.62 £ 

AEM 2 29 c GLY A IP 71.497 9,919 29.395 l.oo 16-49 £ 

ATCM 221 O GLY A 10 70.379 10.448 29.481 1.00 17 T 36 O 

ATCM 222 N HIS A 11 72.295 9.581 30.272 1.00 10.32 K 

ATCM 223 CA HIS A 11 72.068 9 .966 31.688 1.00 13.90 £ 

35 ATCM 224 C HIS A 11 72.008 11.504 31.916 1.00 21.52 £ 

ATCM 225 O HIS A 11 71.700 11.994 32.983 1.00 13.22 Q 

ATCM 226 CB HIS A 11 73.153 9.350 32.581 1.00 14.88 £ 

ATCM 227 CG HIS A 11 74.592 9.949 32.326, 1,90 23,73 £ 

ATCM 228 Ml HIS A 11 75.239 9,649 31.197 1-09 24,99 H 

40 AXSM 229 CD2 his a li 75.167 19,952 32.956 1,99 16-35 £ 

ATCM 230 CE1 HIS A 11 76.317 10.407 31.170 1.00 22.54 £ 

ATCH 231 NE2 HIS A 11 79.271 11.249 32,197 1.99 17.56 U 

ATCM 232 H ARG A 12 72.310 12.288 39.908 1.00 22.31 

ATCM 233 CA ARG A 12 7 2-147 13.693 31.122 1.00 18.90 C 

45 ATCM 234 C ARG A 12 70.851 14. 244 30.495 1.00 26.34 £ 
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Aiai 215 £2 ARG A 12 70.572 15.426 30.604 1.00 25.37 Q 

MSM 236 CB ARG a 12 73,352 14.418 30.587 l.oo 25.93 £ 

&XSM 237 CG ARG ft U 74.582 13.943 31.279 1.00 53.87 £ 

AlOl 239 CP ARG A U 75.757 14.619 30.699 1.00 32.53 £ 

5 AXSM 239 NE ARG A 12 76.359 15. 576 31.605 1.00 69.90 

24Q CZ ARG A 12 76.971 16.675 31.17B 1.00100.00 £ 

AXGM 241 NH1 ARG A 12 77.001 16.948 29.867 1.00100.00 H 

AXfiM 242 NH2 ARG A 12 77.526 17.508 32.056 1.00100.00 

Aiai 243 W GLY A 13 70.078 13.420 29.800 1.00 IB. ?5 jf 

10 Aiai 241 CA gLY A 13 68.802 13.904 29.258 l.QQ 16.50 £ 

ATOM 245 C GLY A 13 67.849 1 4.144 30.428 1.00 18.88 £ 

AXCM 246 9 GLY A 13 68.202 13.902 31.624 1.00 14.04 Q 

AJSH 247 K MET A 14 66.653 14.632 30.103 1.00 16.00 g 

ATq< 21S CA MET A 14 65.688 14.981 31.128 1.00 13.49 C 

15 A3SM 24? C MET A 14 65.293 13.760 3 1.901 1.00 14.0? £ 

Aiai 259 0 MET A 11 $5 t 4QB 13.713 33.145 1.00 17.06 O 

M31 251 CB MET A 14 64.442 15. 605 30.524 1.00 11.57 £ 

AI2M 252 CG MET A 14 63.320 15.628 31.559 1.00 20.77 £ 

Mai 253 SD MET A 14 61.926 16.766 31.110 1.00 29.16 £ 

20 ASSM 2_5_4 CE MET A 14 62.527 17.108 29.574 1.00 30.68 £ 

ATttl 255 N VAL A 15 64.798 12.769 31.158 1.00 25.23 H 

AX2ta 256 CA VAL A 15 64.439 11.468 31.738 1.00 20.90 £ 

MOl 257 C VAL A 15 65.654 10.713 32.378 1.00 17.26 £ 

MSB 259 <? VAL A 15 65.590 10. 239 33.524 1.00 18.41 Q 

25 filOf 259 CB VAL A 15 63.752 10.550 30.680 1.00 23.25 C 

AISM 26? CGI VAL A 15 63.330 9.253 31.310 1.00 15.71 £ 

aTCM 261 CG2 VAT, A IS 62.528 11.193 3O.IB3 l.QQ 13.40 £ 

&Xai 1£2 H SLY A 16 66.784 10.642 31.665 1.00 20.39 H 

A331 263 CA SLY A 16 67 , 9 41 9.904 32.186 1.00 19.54 £ 

30 AXO! 264 C GLY A 16 68.522 10.432 33.492 1.00 29.29 £ 

Ala! 265 O gLY A 16 68.896 9.659 34.434 1.00 16.91 Q 

MS! 266 K 5 BR A 17 68.642 11.755 33.499 1.00 12.53 H 

MS3M 267 CA SER A 17 69.154 12.460 34.650 1.00 21.93 £ 

MCH 268 £ 9 BR A 17 68.209 12.214 35.818 1.00 13.35 £ 

35 Mai 269 Q SEP A 17 68.677 11.957 36.915 1.00 24.19 Q 

Mai 270 CB SER A 17 69.378 13.942 34.333 1.00 15.52 £ 

Mai 271 W SEP A 17 68.153 14.619 34.372 1.00 22.95 Q 

Mai 272 H ALA A 18 66.896 12. 143 35.590 1.00 17.52 U 

Aiai 273 CA ALA A 19 65.991 11.828 36.729 1.00 13.14 £ 

40 Mai 274 C ALA A 19 66.220 10.393 37.307 1.00 19.29 C 

Mai Q ALA A 18 66.149 10.150 38.522 1.00 16.94 O 

Mai 276 CB ALA A 18 64 t 46Q 12.046 36.334 1.00 14.33 £ 

Mai 277 W ILE A 19 66,4 94 9.432 36.430 1.00 20.80 H 

Mai 278 CA ILEA 19 66.705 8.078 36.900 1.00 18.08 £ 

45 Mai 279 C ILE A 13 67.975 8.090 37.730 1.00 16.09 C 
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ATCM. 280 O ILEA 19 68.01B 7.530 38.820 1.00 20.73 fi 

ATCM 2&1 CS ILE A 1? 66.8Q4 7.079 35.710 1.00 17.58 £ 

aiSM 282 CGI ILE A 19 65,114 6.812 35.162 1.00 10.09 C 

AICM 2£3 CG2 ILB A 1? 67.309 5.666 36.133 1.00 21.60 £ 

5 ATOM 284 CD1 ILE A 19 65,529 6.361 33.7 41 1.00 19.05 £ 

AXfiU 285 N ARG A 20 68.984 8.771 37.198 1.00 18.13 H 

AI2M 286 CA ARG A 20 7Q, 296 8.897 37. 836 1.00 20.25 £ 

AXSM 297 £ ARG A 20 70,231 9.491 39.242 1.00 30.62 £ 

AISM 2SJB Q ARG A 20 70.957 9.091 40.129 1.00 33.00 Q 

10 Aiai 289 CB ARG A 20 71.201 9.743 36.957 1.00 11.71 £ 

AXSM 2?Q gg ARG A 29 7 2,61 0 9. 781 37. 449 1.00 23.79 C 

ATCH 29J CD ARG A 20 72.881 11.107 38.060 1.00 36.76 £ 

Aiai m WE ARG A 20 74.297 11. 443 38.062 1.00 48.34 U 

ATOM 29J CZ ARG A 20 74.990 11.841 36.988 1.00100.00 £ 

15 AX£M. 294 N B1 A RG A 20 74.393 11.931 35.808 1.00100.00 H 

AISM 295 NH2 ARG A 20 76.289 12.139 37.076 1.00100.00 1* 

ATOM 226 U ARG A 21 69,368 10.461 39 .439 1.00 22.10 H 

AlOl 297 CA ARG A 21 69.216 11. 052 40.750 1.00 17.45 £ 

ATCM. 29J £ ARG A 21 68.721 10.007 41.73 0 1.00 26.71 £ 

20 ATCM 299 O ARG A 21 69.147 10. 001 42.885 1.00 30.27 Q 

Aiai 300 £p ARG A 21 $8,14,2 12.144 40. 708 1.00 17.93 £ 

ATOM 301 «r ARG A 21 68,692 1? T 522 40.321 1.00 27.57 £ 

ATCM 302 CP ARG A 21 67.586 14.599 40.13 0 1.00 23.02 £ 

ATOM 303 WB ARG A 21 67.619 15.000 38.743 1.00 55.12 H 

25 ATCM 30^ CZ ARG A 21 66.538 15.103 37.99 5 1.00 10.55 £ 

ATCM 305 NH1 ARG A 21 65.343 14.974 38.552 1.00 29.80 H 

AICM 306 WH2 ARG A 21 66.665 15. 435 36.715 1.00 61.45 N. 

Aiai 307 N GLN A 22 67,713 9.223 41. 345 1.00 27.48 £ 

ATOM 309 OA Gfctf a 2g 67,167 8.257 42,313 i t QQ 2 4,79 c 

30 Aiai 309 C GLN A 22 68.137 7.127 42.547 1.00 31.37 C 

ATOM 310 P GLJf A 22 69,391 6.724 43.685 1.00 27.47 Q 

Aiai 311 CP GLW A 22 65,918 7,706 4,1,991 ItQQ 17,11 £ 

AX01 3_12 CG GLN A 22 64.921 8.745 41.243 1.00 66.14 £ 

ATCM 313 CD G LN A 22 63.425 8.456 41.397 1.00 41.27 C 

35 ATCM 314 OE1 GLN A 22 63.002 7.329 41.762 1.00 29.34 O 

ATCM 315 NE2 GLN A 22 62.610 9.464 41.046 1.00 20.12 H 

ATCM 316 W LEP A 23 68.697 6.652 41.448 1.00 27.99 £ 

ATCM 317 CA LEU A 23 69.649 5.575 41.500 1.00 24.48 C 

ATCM 31 8 C LEP A 23 70.828 5.971 42.334 1.00 28.87 £ 

40 ATSM 319 Q LEP A 23 71.288 5.218 43.165 1.00 30.79 £ 

ATOM 320 CB LEU A 23 70,036 5.107 40.089 1.00 22.72 £ 

ATCM 321 CG LEV a 2? 69,966 1,072 39.659 1,00 26,16 £ 

ATCH 322 CP1 LEV A 23 39,271 3.083 39.191 1 T QQ 21,80 £ 

Aids 323 CP2 LEV A 23 68,427 3.291 40,935 l T op 22,91 £ 

45 AlOS IZA U GLV A 24 71.279 7. 192 42.153 1.00 28.77 H 
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ATCM 325 CA GLU* A 24 72,419 7.675 42.909 1.00 33.79 £ 

AI£M 32$ c CLV A 24 72,363 7,399 44.412 1,QQ 35,94 £ 

ATCM 327 9 CL? A 24 73,391 7, HQ 45,031 l.QQ 3? T Q7 o 

AlfiM 328 CB GLU A 24 72.647 9.165 42. 653 1.00 36.21 £ 

5 ATOM 329 CG GLU A 24 7 4,Q69 9.482 42.243 1.00 42.54 £ 

AlflH 330 CD GLU A 24 74.158 lQ t $89 41,333 1.9Q 99,51 £ 

ATCM 331 0E1 <a? A 24 73,386 11-663 41,519 1,00 1? T 21 Q 

AI£M 332 <?B2 QrLV A 24 74.994 19,646 49,399 1,99 66,28 q 

AlSM 333 N GLW A 25 71.192 7.422 45.909 l.QQ 45,79 II 

10 ATCM 334 CA GLN A 25 71.039 7.152 46.432 1.00 47.57 £ 

AT£M 335 C GIN A 25 70.887 5.669 46.740 1.00 67.34 £ 

ATCM 336 o gln, a 25 7Q.295 5,296 47,726 1,99 74.96 2 

AICM 337 CB GLN A 25 69 t 793 7.842 46.905 1.00 51.85 £ 

AlfiM 338 cg GLjf ft 25 69,590 9,991 46.1 99 1, 99 4 4 .91 £ 

15 ATfli 339 CP gln, ft 25 69.419 9.913 46,742 l,Q0199,9O £ 

A1SM 340 QEl GLN A 25 68.271 9.947 47.972 1.00100.00 O 

AlfiM 341 NE2 GLN A 25 67.624 10.602 45. 911 1.00100.00 N. 

ATOI 342 N ARG A 26 71.322 4.831 45.825 1.00 75.37 U 

ATOM 343 CA ARG A 26 71.182 3.407 46.026 1.00 74.87 £ 

20 ftEM 3^1 £ ARG ft 26 72,569 2.791 46.147 1.00 74.08 £ 

AISM 345 Q ARg A 26 73,449 2,997 45.299 l T 99 77.99 2 

AIGM 346 cb AW? A 26 79,399 2,799 44,995 i T QQ 52.44 £ 

ATOM 347 £g arc ft 26 69.9*6 2,927 45,079 1,99 43.51 £ 

AlfiM 3.4J CP ftRG a 26 69,429 1.752 45,964 1.99 40-79 £ 

25 ATOM 349 NE ARG A 26 67.200 1.176 45.338 1.00 42.33 H 

AIOI 350 CZ ARG A 26 67,126 0.508 44.196 1.00 32.07 £ 

Ami 351 NH1 ARG A 26 68.215 0.324 43.486 1.00 44.02 H 

ATOM 352 NH2 ARG A 26 65.968 0.017 43.771 1.00 77.32 H 

AlfiK 353 m gltc ft 27 72,779 2,114 47,266 l-QQ 46,30 u 

30 aish 351 CA SLY A 27 74.960 1,531 47,519 l,Q9 46.82 £ 

AlOti 355 c gl,y A ?? UulAQ 9.165 46.923 1,90 55,45 £ 

ATCM 356 O GLY A 27 75.204 -0.453 46.877 1.00 64.43 2 

AEM 357 N ASP A 28 73.017 -0.315 46.428 1.00 40.98 N 

AXCH 358 CA ASP A 28 73.016 -1.647 45.861 1.00 40.35 £ 

35 axoi 359 C ASP A 29 73.?66 -1,536 44,400 1,99 39,55 £ 

Aiai 369 Q ASP A 28 73.199 -2.519 43,654 l.QQ 49, BQ Q 

ATCM 361 CB AffP A 28 71,690 -2,335 46,127 l.QQ 17, 9Q £ 

Aiai 362 CG ASP A 28 70.503 -1.373 46. 064 l.QQ 35.34 C 

ATCM 363 OD1 ASP A 28 70.705 -0.140 46.095 l.QQ 39.23 Q 

40 364 OD2 ASP A 28 69.383 -1.870 45.872 1.00 69.86 Q 

AXCtt 365 N VAL A 29 73.651 -0.329 43.996 1.00 31.03 H 

AISM 366 CA VAL A 29 73.991 -9,959 42.591 1,99 29,44 £ 

ATai 367 C VAL A 29 75,166 Q,676 42,291 1,09 29,99 £ 

Alas 368 o VAL A 29 75.505 1,699 42,892 l.QQ 34-93 Q 

45 ATCM 36 9 CB VAL A 29 72.696 0. 760 42.000 1.00 30.68 £ 



WO 99/64618 



PCT/US99/11576 



90 



ATOM. 370 CGI VAL A 29 72.335 1.089 40.549 1.00 23.65 £ 

AlCM 371 CC2 VAL A 29 71. 11$ -Q.029 42.156 i,qq 27. 95 £ 

372 N CLV A 30 75,824 0,219 41.230 l,pp 39.76 

AiaJ 373 ca glv ft 30 76.995 Q,924 40.736 1,00 29,38 £ 

5 ATOM 374 C GLU A 30 75.678 1.471 39.332 1.00 31.03 £ 

ATOM 375 O GLU A 30 76.368 0.720 38.3 97 1.00 26.64 Q 

ATCM 376 CB GLU A 30 78.199 0.006 40.722 1.00 31.84 C 

ATOM 377 CG GLU A 30 79.355 0.539 41.533 1.00 89.26 £ 

Aiai 22S CP GL.U A 3Q 80,667 0,264 4Q.959 l,P01Q0.pO £ 

10 ATSU 379 OE1 GLU A 30 81.082 -0.922 40.872 1.00 88.94 £ 

AXSM 380 OE2 GLU A 30 81 , 202 1.206 40.219 1.00100.00 Q 

AXfiM 381 N LED A 31 76.665 2.789 39.207 1.00 22.24 N 

ATCM 382 CA LEU A 31 76.269 3.391 37.945 1.00 29.37 £ 

AlfiM 383 C L EU A 31 77,4 .04 3.507 36 ,9 41 1,QQ 25. 79 £ 

15 ATCM 384 o EE? a 31 78.485 3.969 37.256 l t QQ 2?.41 o 

AICM 385 g B LEV A 31 75.632 4.769 39.191 l.QO 3P,20 £ 

AXCM 386 £G LEV A 3; 74,329 4, t 76? 39 t ?94 1,00 29,37 £ 

AXCli 387 CD1 LRU A 31 73,841 6.143 39.240 1.00 23.43 £ 

ATCM CD2 LEU A 31 7?. 275 3,962 38,291 1-00 23.04 £ 

20 ATOM 389 N VAL A 32 77.146 3.100 35.711 1.00 21.94 

AXCM 39Q CA VAL A 3? 78,143 3.265 34,695 i.OQ 25,48 £ 

ATO M 39 1 c VAL A 3? 77,5 35 4.2 42 33.669 1-QQ 39. 76 £ 

AiaS 392 9 VAL A 32 76,42? 3,999 33,190 1,P0 29,70 Q 

AXSM 393 cp val A 32 78,517 1.9P2 34.055 l,oo 34,25 £ 

25 ATOM 394 CGI VA L A 32 79.587 2.079 32.9 70 1.00 30.56 £ 

ATOM 395 CG2 VAL A 32 79.003 0.950 35.139 1.00 25.27 £ 

ATCM 396 H LEU A 33 78.219 5.375 33.457 1.00 30.19 H 

ATCM 397 CA LEU A 33 77.732 6.463 32.621 1.00 22.71 £ 

ATCM 398 C LEU A 33 78.727 6,979 31,6 45 1-00 29.55 £ 

30 ATOM 399 O LEU A 33 79.896 7.152 31.989 l.QO 30-09 Q 

AX01 4QQ CB LBV A 33 77.423 7 t 6?5 33,514 l t QP 19,75 c 

Aiai 4Q1 C<? LEV A 33 76 T 72P 7.2Q0 34,779 l.QO 19,38 £ 

AXSM 402 CDl LEU A 33 76.814 8.344 35.762 1.00 27.24 £ 

ATCM 403 CD2 LEU A 33 75.271 6.913 34.444 1.00 22.07 £ 

35 ATOM 40 4 N ARG A 34 78.239 7.421 30.496 1.00 15.09 

A1SM 405 CA ARG a 33 79,154 9,008 29.541 1,90 26.04 £ 

ATOM 3Q6 g ARC A 34 79,4,69 9,173 29,916 1,00 36.57 £ 

Aiai 407 Q ARff A 34 77.288 9,130 29.651 1-PO 38,59 q 

ATOM 408 CB ARG A 34 79,486 7.048 28.398 1.00 22.89 £ 

40 ATCM 409 CG ARG A 34 80.579 6.081 28.706 1.00 23.29 £ 

ATOM 410 CD ARG A 34 81.370 6.575 29.860 1.00 52.06 £ 

Aiai 411 WE ARC A 34 81.793 5.458 30.711 1,PP 80.25 U 

ATOM 412 C2 ARG A 34 82.646 4.530 30.323 l.QO 41.94 £ 

A12M 413 NH1 ARG A 34 83.173 4.596 29.104 1.00 53.02 £ 

45 Aiai 414 WH2 ARG A 34 82.983 3-547 31.148 l.QO, 25.56 H 
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ATCM 415 M THR A 35 79.248 10.156 28.539 1.00 31.58 & 

bl£X 416 CA THR A 35 78.703 11.282 27.833 1.00 29.33 £ 

auai 417 c TH R A 35 79,71? 1Q.951 26,349 l r QQ 32,53 £ 

ATCM 41B O THR A 35 79.350 9,944 25,962 l.PQ 28.08 2 

5 axoj 419 CB THR A 35 79,527 12,527 29.115 1,99 37,4? £ 

ATCM 420 XI THR A 35 80.844 12 .429 27.560 1.00 31.91 Q 

AXCM. 421 CG2 THR A 35 79.627 12.642 29.651 1.00 19.38 £ 

AXC* 422 W ARE A 36 7?, 032 11.799 25,529 i T pp 39.92 N 

AXCH 423 £A ARG a 36 78.992 11,639 24,956 1,99 29,37 £ 

10 ATCM 424 g ARg A 36 79.49$ 11,765 23,593 1-99 31,46 £ 

ATCH 425 o arg A 36 79,772 11,912 22,591 1,99 36.56 o 

AXCM. 426 CB ARG A 36 77-954 12.650 23,354 1.90 37,34 £ 

ATJQM 427 CG ARG A 36 76.937 12.465 21.846-99.00 49.47 £ 

ATCM 428 CD ARG A 36 76.020 13.515 21.232-99.00 63.09 £ 

15 ATCM 429 NE ARG A 36 75.528 13.124 19.915-99.00 75.23 K 

ATCM 439 eg ARG a 36 74,381 13,549 19.391-99.99 91.44 C 

ATCM 431 HH1 ARG A 36 73.605 14.375 20.079-99.00 79.32 H 

AXOJ 432 MH2 ARG a 36 74,999 13,144 19,185-99,99 79.73 H 

ATCM. 433 N ASP A 37 80.217 12. 677 24.063 1.00 41.30 H 

20 ATCM. 434 CA ASP A 37 81.606 12.710 23.601 1.00 44.91 £ 

AXCM. 435 C ASP A 37 82.410 11.481 24.043 1.00 24.99 £ 

ATCM 436 O ASP A 37 83.211 19, 978 23 ,261 1,09 4 2,22 Q 

ATCM 437 CB ASP A 37 82.347 14.048 23.718-99.00 47.07 £ 

ATCM 438 CG ASP A 37 81.881 14.887 24.876-99.00 62.99 _£ 

25 ATCH 439 OD1 ASP A 37 80.679 14.839 25.204-99.00 64.45 O 

ATCM 440 OD2 ASP A 37 82.711 15.638 25.429-99.00 69.84 Q 

ATai 441 N G LU A 38 82.129 10.950 25.235 1.00 19.39 N 

ATCM 442 CA GLU A 38 82.790 9.717 25.682 1.00 27.84 £ 

ATCM 443 C GLU A 38 82.203 8.527 24.901 1.00 37.14 C 

30 AX£M 444 o glu a 39 92,973 7.511 24.699 1,99 35,94 Q 

ATCM 445 CB GLU A 38 82.691 9.435 27.207 1.00 25.18 £ 

ATCM 446 CG GLU A 38 83.116 10.549 28.163 1.00 37.45 £ 

AXCH 447 CP giV a 3B 92.897 19,212 29.655 1,99 21,13 £ 

ATCM 448 OE1 GLU A 38 81.623 9 T ?97 30,91 4 1.00 55.97 O 

35 atch 449 OE2 glu a 38 93.757 9.978 39.419 1,99 99.79 Q 

ATCM 450 N LEU A 39 80.948 B.610 24.47B 1.00 25.52 £ 

Aiai 451 CA LEV A 39 89-449 7,493 23.739 1,99 18,17 £ 

ATCU 452 C LBV a 39 79,291 7,7$4 22.825 1,99 20,34 £ 

AICM 453 O LEU A 39 78.152 7.810 23.259 1.00 26.35 Q 

40 ATCM 454 CB LEU A 39 99,123 6.313 24.657 1.00 14.56 C 

ATOM 455 CG LEU A 39 79.410 5.075 24.058 1.00 19.52 £ 

AJSM 456 CD1 LEU A 39 80.205 4.392 22.994 1,99 19,84 £ 

ATCM 457 CD2 LEU A 39 78.890 4.051 25.084 1.00 17.41 C 

ATCM 458 N ASN A 40 79.598 7.880 21.543 l T 0p 16,73 H 

45 ATCM 459 CA ASN A 40 78.548 7. 971 20.540 1.00 21.55 £ 
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AEM 160 C ASM A 40 77,79 9 6.649 20.308 1.00 24.53 £ 

AXCfl 16J Q ASN A 40 79.328 5.720 19.688 1.00 19.96 Q 

ATOM 462 CB ASN A 40 79.130 9.367 19.216 1.00 18.45 £ 

AXSM 16? £C ASH A 10 78.051 8.727 18.225 l t op 12,19 £ 

5 Aiai 164 9pi APH A 10 79.327 9,993 17,989 1,99 ?a T 89 2 

AXC£ 465 ND2 A SM A 40 76.827 9,73 9 18.697 1.00 23.71 H 

&X£H 166 H LEO A 41 76.543 6.622 20.754 1.00 21.08 N. 

ATCM 467 CA LEU A 41 75.649 5.465 20.650 1.00 15.03 £ 

AlfiM ifcfi £ LOT ft 11 75.225 5,068 19,21? l-QQ 18,22 £ 

10 AISM 16? 9 LBV A 11 71,681 3.971 18.980 1.99 15,72 Q 

A3SI1 17Q cp lev A 11 71-126 5.795 21.532 ItQP 15r85 £ 

AX£H 171 Cg LEV A 11 71.922 6,929 22.971 l.QQ 21.90 £ 

AX34 472 CDl LEU A 41 73,694 6.413 23.749 1.00 20.59 £ 

Aiai 473 CD2 LRU A 41 75.481 4.796 23.609 1.00 17.97 £ 

15 A131 171 N LEU A 42 75.512 5.916 16.238 1.00 12.45 (J 

AlflM 475 CA LEU A 42 75.256 5.607 16.831 1.00 15.99 £ 

AXOI 176 c LBV A 12 76,299 1,689 16,280 1,09 26,19 £ 

AXSM 177 Q LBV A 12 76.066, 1.939 15,257 i,Q0 22,11 q 

ATOM 4±S CB LBV A 1? 75,282 6r973 15.991 1,99 17,85 £ 

20 AXOI 479 CG L EU A 42 74.180 7.854 16.399 1.00 30.70 £ 

AEM 480 CDl LEU A 42 71.318 9.184 15.704 1.00 24.31 £ 

AX£H 481 CD2 LEU A 42 72.764 7.241 16.208 1.00 31.13 £ 

Aiaj 482 N ASP A 43 77.462 1.705 16,911 l.QQ 26.87 H 

AX£H 483 CA ASP A 43 78.579 3 ,975 16.186 1,99 19.29 £ 

25 ATCH 191 C ASP A 43 78.583 2.519 17.163 1.00 13.33 £ 

AX23 195 o ASP a 79,051 2,348 19,297 l r 99 18.75 Q 

AlfiM 486 CB ASP A 43 79.870 4.580 16.776 1.00 31.06 C 

Aiaj 487 CG ASP A 43 81.083 3.758 16.380 1.00 30.68 C 

AI23 488 OD1 ASP A 43 80.971 2.551 16.082 1.00 32.36 Q 

30 ATOM 489 OD2 ASP A 43 82.187 1,399 16.499 1.00 37.83 Q 

AXGM 199 W SERA 44 78.139 1.544 16,377 1.99 16.89 M 

A1SM 191 CA sera 14 77,979 9,173 16.799 1,09 17,67 £ 

AXOI 492 C SERA 44 79.237 0.463 17.392 1.00 20.40 £ 

ATCM 193 o ser a 44 79,296 -1,126 19,111 1,90 26,2? o 

35 Aiai 494 CB SERA 44 77.504 -0.617 15.581 1.00 13.85 £ 

Aiai 495 OG SER A 44 76.800 -1.740 16.063 1.00 43.83 Q 

ATOM 496 N ARG A 45 80.335 -0.301 16.682 1.00 15.63 N. 

ATOM 497 CA ARG A 45 81.616 -0.789 17.154 1.00 19.94 £ 

AlfiH 199 C ARC A 15 91,919 -0,225 18.521 1,90 29,18 £ 

40 Aiai 499 O ARG A 45 82.244 -0.937 19.45 7 1.00 27.65 Q 

AXSU 500 CB ARG A 45 82.684 -0.261 16.203 1.00 27.46 £ 

ATCM 501 CG ARG A 45 83.463 -1.338 15.495 1.00 92.03 £ 

AX£M 502 CP ARG A 15 91.954 -1,119 16.977 1-99199.99 £ 

AEW 593 HE ARC A 15 95.636 -2.533 15.527 1,09190-99 K 

45 ATCM 504 CZ ARG A 45 86.092 -3.570 16.236 1.OO100.00 £ 
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ATOM 505 MH1MSA 45 85.791 -3.695 17.547 1. 00100.00 M 

ATCM 506 KH2 ARC A 45 86.773 -4.S44 15.642 1,90199,90 H 

ATCM 507 H ALA A 46 81.772 l.Qgg 16.62? 1.90 31,04 M 

ATCM 508 CA ALA A 46 82.045 1.743 19.881 1.00 24.72 £ 

5 ATCM 509 C ALA A 46 81.111 1,176 20,899 1,00 17,73 £ 

ATCM 510 O ALA A 46 8J^12 0.825 22,027 l.QQ 22,73 2 

ATCM 511 CB ALA A 46 81.839 3 T 221 19. 751 l.QQ 27.16 £ 

ATCM 512 H VAL A 47 79.835 1,119 20.531 1,00 17.54 H 

ATCM 513 CA VAL A 47 78.878 0,608 21,506 1,00 21.41 £ 

10 ATCM 514 C VAL A 47 79.262 -0.812 21.914 1.00 30.25 £ 

ATCM 515 O VAL A 47 79.192 -1.202 23.097 1.00 15.85 Q 

ATCM 516 CB VAL A 47 77.470 0.668 20.9B9 1.00 18.59 £ 

ATCM 517 CGI VAL A 47 76.503 Q.Q42 22,01? 1-0P 16 T BB £ 

ATCM 518 CG2 VAL A 47 77.115 2.096 20.756 1.00 16.28 £ 

15 ATCM 519 N HIS A 48 79.692 -1.585 20.920 1.00 21.00 H 

ATCM 520 CA HIS A 48 80,Q29 2,969 21.192 1,00 20.17 £ 

ATCM 521 C HIS A 48 81.268 -3.079 22.117 1.00 32.98 £ 

ATCM 522 Q HIS A 48 81.289 -3 .850 23.102 1.00 28.20 Q 

ATCM 523 CB HIS A 48 80.063 -3.801 19.855 1.00 14.93 £ 

20 ATOM 524 CG HIS A 48 78.686 -4.172 19.339 1.00 26.67 £ 

ATCM 525 KD1 HIS A 48 78.085 -5.394 19.600 1.00 28.83 H 

ATCM 526 CD2 HIS A 48 77.758 -3.448 19, €59 1-00 25,56 £ 

ATCM 527 CE1 HIS A 48 76.887 -5.430 19.043 1,00 2PtQ8 £ 

ATCM 528 ME2 HIS A 49 76.660 -4,260 10,4,75 l.PQ 25.22 M 

25 ATCM 529 N ASP A 49 8 2.217 -2.170 21.902 1.00 22.62 H 

ATCM 530 CA ASP A 49 83.455 -2.169 22.674 1.00 24.23 C 

ATCM 531 C ASP A 49 83.171 -1.899 24.122 1.00 38.72 £ 

ATCM 532 O ASP A 49 83.708 -2.551 25.027 1.00 35.44 S 

ATCM 533 CB ASP A 49 84.396 -1.112 22.127 1.00 30.29 £ 

30 ATOM 534 CG ASP A 49 84.991 -1.503 20.775 1.00 52.45 £ 

ATOM 535 OD1 ASP A 49 85.007 -2.726 20.449 1.00 42-67 Q 

ATCM 536 OD2 ASP A 49 85.416 -0.587 20. 029 1.00 73.76 Q 

atcm 537 k php. a 50 82.294 -Q.929 24.324 1,00 32.19 H 

ATOM 538 CA PHE A 50 91.902 -0.550 25.649 1.00 29.76 £ 

35 ATCM 539 C PHE A 50 81.299 -1.765 26.359 1.00 30.31 £ 

ATCM 540 Q PHE A 50 91.715 -2.124 27.449 1.00 29.22 Q 

atom 541 cb phe a so 80.892 0.610 25.576. 1,00 23.82 £ 

ATCM 542 CG PHE A 50 80.137 0,84? 26.859 1,00 19,13 C 

ATCM 543 CD1 PHE A 50 80.740 1.515 27.931 l.QQ 20.14 £ 

40 ATCM 544 CD2 PHE A 50 78.835 0.360 27.018 1.00 13.99 £ 

ATCM 545 CE1 PHE A 50 80.034 1.742 29.129 1,00 25.81 £ 

ATOM 546 CE2 PHE A 50 78.114 0.553 28.212 1.00 22.84 £ 

ATCM 547 C7. PHE A SO 78.698 1,276 29t2^ 1.00 23.40 £ 

ATCM 548 W PHE A 51 80.280 -2. 367 25.768 1 T 0Q 21.75 U 

45 atcm 549 ca phe a 51 79,655 -3,451 26,457 1.00 22.61 £ 
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Ala! 595 Cgl ILE A 56 77,643 -2.391 31,442 1,90 19.39 £ 

arai 5?$ rez ilb a se 75,214 -3 t qi6 31,549 1,99 19.94 £ 

axcm 597 cdi ile a se 77.998 -1.936 30. 026 1.00 60.42 & 

AEM 598 M ASP A 57 75.166 -5.133 34.237 1.00 16.84 H 

5 AlOi 599 CA ASP A 57 74.040 -5.999 34.630 1.00 16.33 £ 

&X£H €00 c Asp a 57 72.676 -5.451 34.123 1,99 29,49 £ 

AXCM 6JU Q ASP A 57 71.836 -6.198 33.657 1.00 25.50 Q 

AXEM 602 CB ASP A 57 74.009 -6.194 36.164 1.00 16.94 £ 

Aiaj 693 Cg ASP ft 57 75.369 -6.720 36.703 1.00 34.27 £ 

10 AXOi 604 QD1 ASP A 57 75.875 -7.729 36.1 41 1.00 31.76 Q 

ARB 605 OD2 ASP A 57 76.040 -6.007 37.499 1.00 28.36 Q 

ATOM 606 M GLN A 58 72.443 -4.152 34.220 1.00 28.91 K 

ATOM 607 CA GLN A 58 71.183 -3.590 33.755 1.00 25.68 C 

Aiaf 608 C GLN A 58 71. 425 -2.364 32,881 1,Q0 ?? t 21 £ 

15 AXOi 6Q? g GLM A 58 72,493 -1,629 33.967 1,99 19,16 q 

AISM 619 cp GLN A $8 79.312 ~3,151 34,946 1,QQ 33,14 £ 

ATCH 611 CG GLN A 58 69.798 -4.241 35.807 1.00 30.00 £ 

AISM 612 CD GLN A 58 69.226 -3.712 37.105 1.00 27.18 £ 

AXSM 613 OE1 GLN A 58 68.722 -2. 601 37.161 1.00 31.20 Q 

20 A1£M 614 NE2 GLN A 58 69.455 -4.436 38.18 6 1.00 16.89 N. 

Aiai 615 N VAL A 59 79.496 -2,139 31.961 1,99 19.35 H 

AKM 616 CA VAL a 59 79.562 -9,999 31.915 1-99 15.59 £ 

ATOM 617 C VAL A 59 69.238 -0.240 31.039 1.00 26.28 £ 

A1SM 619 Q VAL A 59 68.178 -0.820 30.762 1.00 19.51 2 

25 ATCH 619 CB VAL A 59 70.707 -1.456 29.601 1.00 15.32 C 

ATSM 620 CGI VAL A 59 70.477 -0.274 28.649 1.00 11.93 £ 

ATOM 621 CG2 VAL A 59 72.080 -2.111 29.364 1.00 15.83 C 

ATOM 62 2 N TYR A 60 69.306 1.064 31.293 1.00 21.71 £ 

ATOf 623 ca m A 69 69,113 1,927 31,197 1,90 21,4 ,0 £ 

30 ATCH 6 2 4 C TYR A 60 69 .299 2.756 29.928 1.00 18.69 £ 

625 Q TYR A 69 69,259 3,532 29.796 1,99 15,51 Q 

AXO* 626 CB TYR A 60 69, 9 2; 2.817 32.413 1.00 17.24 £ 

ATCK 627 CG TYR A 60 67,493 2.131 33.658 1.00 19.71 £ 

ATOM 628 CDI TYR A 60 68.345 1.583 34.586 1.00 21.14 £ 

35 aich 629 CD2 TYR A 60 66,1 5 4 2,223 33.991 1,99 29,16 c 

AXCH 639 C El TYR A 60 67,93 5 1.080 35.794 1.00 19.11 £ 

AXOi 631 CE2 TYR A 69 65.649 1.698 35.163 l.oo 10.77 c 

AEM 632 CZ TYR A 60 66-476 1.094 36.054 1.00 20.07 £ 

ATOM 63 3 OH TYR A 60 65.921 0.585 37.248 1.00 16.04 O 

40 ATOM 634 N LEU A 61 67.491 2.452 28.916 1.00 17.46 H 

AlfiM 635 CA LEU A 61 67.685 3.053 27.585 1.00 20.17 £ 

AEM 636 C LEV A 61 67.993 4.412 27.499 1,99 23,36 £ 

AJOi 637 Q LBV A 61 65.925 4.526 26,799 1,09 11,96 Q 

AXCH £3j CB LEU A 61 67.267 2.060 26.495 1.00 14.78 C 

45 ATOM 639 CG LEU A 61 68.117 2.142 25.299 1.00 15.52 C 
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ATOM 640 GDI LEU A 61 67.915 1.010 24.109 1.00 7.75 £ 

641 CP2 Lgy A 6J, 59,097 3.54,1 24,599 1-00 15,20 £ 

fiX£M 642 N AIA A 62 67.656 5.434 27,956 1,QQ 29,35 JJ 

AXCM 643 CA AIA A 62 67.120 6.784 27.963 1.00 18.55 £ 

5 AJKH 644 C AIA A 62 67.779 7.739 26.949 1.00 18.57 £ 

AICM 645 o aia a 62 67.455 9,924 26,920 1,99 24.31 2 

aiOi 646 CB ALA A 62 67.971 7,377 2?. 43? l t QQ 11,69 £ 

ATQH 64 7 N AIA A 63 68.681 7.?3; 26,1 91 1,00 14-09 U 

AXCM 648 CA AIA A 63 69.249 B t Q95. 25.952 1-99 12.94 £ 

10 AXSM 649 C AIA A 63 68.310 8.005 23.877 1.00 27.00 C 

AJ£M 659 9 AIA A 63 67.945 6.916 23.511 1.00 24.51 Q 

ATOM 651 CB AIA A 63 70.665 7.660 24.634 1.00 4.89 £ 

ATOM 652 N AIA A 64 68.076 9.148 23.262 1.00 21.05 H 

ATOM 653 CA AIA A 64 67, 29? 9, 296 22 .986 1, 99 13.59 £ 

15 ATOM 654 C AIA A 64 67.435 10.664 21.416 1.00 28.08 £ 

ATSE 655 Q AIA A 64 67,997 11,609 22t921 1.99 26.63 Q 

ATCH 656 CB AIA A 64 65.642 9.171 22.518 1 ,P0 7. 63 £ 

ATOM 657 W LYS A 65 66.953 10.781 20.182 1.00 23.98 N 

Aiai 658 CA LY? A 65 66.966 12.012 19.409 1.00 20.47 £ 

20 ATOM 659 C LYS A 65 65.488 12.443 19.551 1.00 24.37 C 

ATOM 669 9 LYS A 65 64,594 11.997 19.976 1-99 29.2? Q 

661 CB L ys A 65 67 .31 7 11,6 59 1 7 .951 1,90 25. 59 £ 

AXOi 662 CG LYS a 65 66.808 12,639 16.923 1 T PP 27,54 £ 

AXSW 663 CD LYS A 65 67.518 13.926 17.169 1.99 21.99 £ 

25 ATCM 664 CB LYS A 65 67.316 14.905 16.029 1.00 55.15 £ 

AI21 665 NZ LYS A 65 67.876 16.263 16.392 1.00 81.63 H 

AI£M 666 N VAL A 66 65.22B 13.362 20.485 1.00 22.47 K 

ATOM 667 CA VAL A 66 63.873 13.850 20.755 1.00 18.99 £ 

AXO} 668 C VAL A 66 63.711 15.343 20.394 1.00 31.44 C 

30 ATCM 66? o VAL A 66 64.665 16.197 29.469 1,99 34.61 Q 

ATOa 679 CB VAL A 66 63,440 l? T 62? 22.294 l.PQ 16,66 £ 

ajjjj 671 CGI VAL A 66 64.269 12.623 22.969 1.99 15.91 £ 

ATOM 672 CG2 VAL A 66 63.379 14.904 22.950 1.00 19.21 C 

AXaa 673 M GLY A 67 62.514 15.755 19.994 1.00 18.03 £ 

35 AXCM 674 CA SLY A 67 62,299 17,119 19.614 1,99 14, ?0 £ 

AXOj 675 £ gly a 67 69.792 17 T 519 19,585 l f QQ 32.35 £ 

aiSM 676 9 gly A 67 59.922 16-666 19-988 l t Qp 19. 98 o 

ATSH 677 W gLY A 69 69.593 19-7B7 19.256 1.99 23.21 H 

ATCM 678 CA GLY A 68 59.132 19.2B8 19.183 1.00 23.83 £ 

40 ATOM 679 C GLY A 68 58.540 19.137 17.771 1.00 19.31 £ 

AXOH 680 O GLY A 68 59.165 18.550 16.870 1.00 30.64 O 

A2SM 691 W I LB A 69 57. 3 43 19-69 4 1 7. 5 99 1,99 15,20 H 

AXSM 692 CA ILE A 69 56.595 19.632 16.317 1,99 16.99 £ 

Aiaj 693 C ILE A 69 57.397 29-153 15,112 1,99 19.3? £ 

45 AlfiM 684 Q ILEA 69 57.425 19.5 19 14.061 l.OO 14.66 O 
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AICH 68 5 CB ILE A 69 55.257 20.432 16.480 1.00 30.11 £ 

AXfiM 686 CS1 ILE A 69 54.271 19.683 17.385 1.00 24.27 C 

AXStS $87 CG2 I LB A 6? 54.610 20.749 15.18 1 1.00 47.53 £ 

Ala! 698 CP1 ILE A 69 53.259 20.608 18.05 6 1.00 85.71 £ 

5 689 N VAL A 70 58.010 21.327 15.269 1.00 23.03 H 

AI£M 690 CA VAL A 70 58.797 21.913 14.183 1.00 19.34 £ 

AXOf 6gl £ VAL A 70 59.983 21.011 13.840 1.00 24.42 £ 

AX£M 692 O VAL A 70 60.335 20.829 12.662 1.00 24.14 Q 

AI£H 693 CB VftL A 79 59.304 23.404 14.467 1.00 21.37 £ 

10 AI£H 694 Cgl VAL A 79 60.137 23.907 13.28 1 1.00 17.79 £ 

Aiai 695 g<?2 VAL A 79 58.136 24.410 14.67 8 1.00 15.74 £ 

AICK 696 N ALA A 71 60.621 20.450 14.86 1 1.00 19.68 H 

AIOl 697 CA ALA A 71 61.782 19. 617 14.572 1.00 16.57 £ 

Aiai 699 C ALA A 71 61.427 18. 289 13.910 1.00 23.36 £ 

15 ATW 699 9 ALA A 71 61.980 17.923 12.849 1.00 21.84 Q 

AISM 700 £B ALA A 71 62.685 19.439 15.805 1.00 9.36 £ 

ATCM 701 H ASK A 72 60.463 17.598 14.511 1.00 16.80 N. 

ATCM 7P2 CA ASH A 72 59.998 16.357 13.92 3 1.00 18.84 £ 

AISM 703 C ASN A 72 59.608 16. 539 12.440 1.00 23.87 £ 

20 ATCtt 7Q4, Q ASN A 72 59.919 15.696 11.593 1.00 21.52 Q 

ATCM 705 CB ASN A 72 58.835 15.806 14.738 1.00 8.60 £ 

AXCM 796 £g ASH A 72 59.309 15.013 15.91 1 1.00 23.75 £ 

ATOM 7Q7 QP1 ASM A 72 59.558 13.809 15.810 1.00 23.98 Q 

ATOi 7Q8 ffiP2 ASH A 7? 59.572 15.701 16.996 1.00 9.96 N. 

25 Alas 799 W ASH a 73 58,931 17,647 12.139 l T 99 23,97 H 

ATCM 719 PA ASH A 73 58.521 17.971 10.761 1.00 26.05 £ 

AISM 711 C ASN A 73 59.665 18.454 9.817 l.OO 26.95 C 

ATOM 712 O ASN A 73 59.613 18.276 8.569 1.00 22.13 Q 

ATCM 713 CB ASN A 73 57.383 19.001 10.800 1.00 14.86 £ 

30 ATPM 714 CG ASN A 73 56.015 18.349 10.987 1.00 19.88 £ 

AI£M 715 OP1 ASN A 73 55.620 17.468 10.217 1.00 27.02 O 

Aiai 716 HP2 ASN A 73 55,332 19,732 12,951 1,99 29,78 H 

ATCM 717 H THR A 74 60.710 19.029 10.419 1.00 18.69 H 

ATOM 718 CA THR A 74 61.845 19.540 9.657 1.00 10.07 £ 

35 AI£H 719 C THR A 74 62.968 18,548 9.375 1.00 21.00 £ 

M£M 729 Q THR A 74 63.537 18.561 8.289 1.00 11.75 O 

AT£M 72^ CB THR A 74 62.411 20.746 10.306 1.00 29.10 C 

Aiai 722 OG1 THR A 74 61.370 21.714 10.457 1.00 23.24 O 

ATcm 723 cg2 THR A 74 63.541 21.299 9.152 1-9P 21,63 £ 

40 ATJ2M 724 N TYR A 75 63.230 17.636 10.310 1.00 17.10 tf 

AI£M 725 CA TYR A 75 64.267 16.620 10.112 1.00 9.07 C 

ATCM 726 C TYR A 75 63.733 15.203 10.318 1.00 6.17 £ 

ATSU 727 9 TYR A 75 64.143 14.542 11.267 1.00 15.58 Q 

AXSM 729 CP TYR A 75 65.392 16,925 11.199 l.QQ 11.89 £ 

45 AlfiM 729 CG TYR A 75 65.779 IB. 234 11.252 1.00 27.1? £ 
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AISM 225 H HRA 81 56,151 7,53? 11.659 1,00 11,18 U 

ATCM 776 CA TYR A 81 67.486 6.902 11.630 1.00 15. 06 £ 

ATCM 777 C TYR A 61 68.237 6.782 12.959 1.00 16.83 £ 

AISM 779 P TYR A 81 68.714 5.702 13.383 1.00 16.74 Q 

5 ATOM 779 CB TYR A 81 68.384 7.599 10.616 1.00 9.43 £ 

AICH 7 8 0 eg TY R A 81 $ 9 .74 9 6,966 10,541 1,00 22.54 £ 

ATCH 781 CD1 TYR A 81 69.963 5,824 9.747 1.00 22.37 £ 

AEM 782 CP2 TYR A 81 70.818 7,46.6 11.299 1,Q9 18.07 £ 

AISM 783 CE1 T YR A 81 71.202 5.163 9.746 1.00 15.02 C 

10 A2£H 784 CR2TYRA 81 72.080 6.893 11.201 1.00 17.37 £ 

ATOM 785 CZ T YR A 81 72.255 5.698 10.472 1.00 24.27 £ 

A TOM 786 PP TYR A 81 73.491 5.063 10.409 l.QQ 19-57 £ 

ATCU 787 W SLN A 62 68,385 7 T 918 13.612 1.00 11.39 K 

ATCH 788 ffl (?HfA 82 69.193 7,930 14,910 1-09 12.23 £ 

15 ATCtt 789 c gln A 82 68,544 7.099 15,934 1.00 14.18 £ 

AISM 790 O GLN A 82 69.180 6.415 16.631 1.00 11.35 Q 

AISM 791 CB GLN A 82 69.280 9.354 15.291 1.00 18.73 £ 

ATCH 222 CG GLN A 82 69.986 10.209 14.250 1.00 13.54 £ 

ATOM 793 CD GLN A 82 70.285 11.617 14.736 1.00 26.00 £ 

20 ATOM 794 OE1 GLN A B2 70.410 11.850 15.927 1.00 22.99 £ 

ATCU 795 NE2 A 82 70.4P4 12-561 13,908 1.00 16.59 H 

ATCU 796 n asn a 93 67.235 7 f l81 15.869 1.00 11.35 K 

ATCH 797 CA ASN A 93 66.549 6.4Q8 16.960 1.00 13.71 £ 

AISM 798 £ ASH A 83 66.623 4.902 16.557 1.00 21.43 £ 

25 AISM 799 O ASN A 83 66.831 4.101 17.463 1.00 12.10 Q 

AlfiM 800 CB ASM A 83 65.132 6.945 17.074 1.00 13.51 £ 

AISM 901 C<? ASy A 93 6 5, 13 1 8,245 17.871 1.00 28.91 £ 

AISM 902 QPl ASN A 93 65.629 9,263 18.990 1.90 22.29 Q 

AISM 803 ND2 ASN A 83 64.756 9.354 17.237 1.00 20.17 N 

30 AISM 804 N MET A 84 66.592 4.517 15.290 l-OQ 15,63 N. 

AISM 805 CA MET A 84 66.704 3.101 15.007 1.00 15.66 £ 

AISM 806 C MET A 84 68.054 2.588 15.34 8 1.00 14.66 £ 

ATOM 807 O MET A 84 68.148 1.514 15.902 1.00 11.45 Q 

ATOM 808 CB MET A 84 66.418 2.815 13.563 1.00 17.59 £ 

35 AISM 809 C? WET A 94 64.911 2,894 13.22Q 1.00 14.40 £ 

AISM 610 gp met A 94 64.638 2.811 11.387 1.90 15.99 £ 

AEM 811 CE MET A 84 65.164 1.105 10.952 1.00 8.90 C 

AISM 812 W MET A 85 69.098 3.338 15.024 1.00 11.20 N_ 

AIC±f 813 CA MST A 85 70.469 2.879 15.321 l.QQ 11.67 £ 

40 ATCH 814 C MET A 85 70,779 2.831 16.774 1.00 13.04 £ 

AISM 815 o MET A 85 71.359 1.893 17.265 IrOO 15.26 £ 

AISM 816 CB MET A 85 71.525 3.798 14.693 1.O0 15.07 £ 

ATCH 817 £g MET A 95 71.530 3.726 13.173 1.P9 32.01 £ 

AISM 818 SD MET A 85 71.918 2.027 12.487 1.00 37.79 5. 

45 AISM 819 CE MET A 85 73.379 1.801 13.320 1.00 15.94 £ 
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atom ' 820 N ILE A 96 1SLA11 3.932 17,491 1,P9 13,92 H 

aiSM 821 CA ILE A 86 70.760 3.993 18.912 1.00 12.58 £ 

ATOM 822 Q ILE A 86 70,159 2 t 662 19.591 1.00 21.61 C 

ATCM 823 £2 ILE A 66 70.813 1,981 2P.3S2 1.00 18.68 £> 

5 Aiai 924 £B ILE A 9$ 70.225 5,199 19.6Q6, l.QQ 11,94 £ 

ATCM 925 cgj ILE A 96 7Q,978 6,4,29 19.11? l.QP 19,76 £ 

AISM 826 cs2 ILE A 96 7Q.435 5,132 21.112 1-QP 6,5? c_ 

ATCM 827 CD1 ILE A 8 6 70.505 7.694 19.772 1.00 20.37 £ 

ATCM 828 N GLU A 87 6 9,993 2.393 19.316 1.00 18.78 N. 

10 ATCM 829 CA GLD A 87 68.263 1.237 19.930 1.00 14.00 £ 

AXCtf. 939 C GLU A 87 68.797 -0.116 19.454 1.00 15.93 C_ 

ATOM 831 P GL? A 97 $9.017 -P.991 29.269 1 T QQ 11. Q4 Q 

AICM 832 CB glu a 97 66,734 1,324 19.9P9 1-QP 14.99 £ 

ATCM 833 CG GLU A 87 66.085 1,327 18 T 538 1.00 2B.96 £ 

15 A TCM 834 CD GLU A 87 64,635 1-922 19.544 1.00 11.12 £ 

ATOM 835 OE1 GLU A 87 64.307 2.801 19.376 1.00 25.46 Q 

ATCM 836 QE2 GLU A 87 63.845 1.547 17.663 1.00 29.87 Q 

ATCM 837 M SE RA SB 69.054 -0.259 18.155 1.00 16.18 & 

A1S2J 938 CA SEP, A 98 69.65Q -1,492 17,569 j T pp 19,52 £ 

20 ATCH 939 £ SBR A 99 71,929 -1,792 19.16P 1,PP 22,54 £ 

A1SM 94Q O PER A 99 71.313 -Z,?Z? 18,59? 1,QQ 13 t 80 Q 

ATCM 941 CB SERA 99 69,815 -1,326 16,023 l.PQ 14.61 £ 

ATCM 142 QG SER A 99 6.9,551 -1,291 15.355 i,qq j5,3l Q 

A1CH 643 N ASK A 89 71.8B4 -0.773 18.143 1.00 22.63 U 

25 ATCM 844 CA ASM A 89 73,227 -Q.B69 19,693 1 T QP 27,23 £ 

ATCM 845 C ASM A 89 73.195 -1.363 20.134 1.00 21.34 £ 

ATCM 846 O ASM A 89 73.795 -2.384 20.476 1.00 23.68 Q 

ATCM 847 CB ASH A 89 73.980 0.487 18.597 1.00 13.71 C 

ATCM 848 CG ASN A 89 74.440 0.825 17.168 1.00 20.40 C 

30 ATCM 849 OD1 ASM A 89 74.305 -0.006 16.255 1.00 14-93 Q 

ATCM B50 MD2 ASM A 89 74.937 2.067 16.960 1.00 13.32 N. 

ATCM 651 N ILE A 90 72.48B -0.646 20.979 1.00 16.55 N. 

ATCM 852 CA ILE A 90 72,437 -1.014 22,399 LOO 21,51 £ 

AT CM 85J £ ILE A 99 71,976 -2.421 22,729 1,99 26.50 . c 

35 ATOM 854 O ILE A 90 72.384 -3.159 23.590 1.00 19.71 Q 

ATCM 855 CB ILEA 90 71.670 0.070 23.233 1.00 13.32 C 

ATCM 856 CGI ILE A 90 72.539 1.299 23.401 1.00 11.05 C 

ATCM 857 CG2 ILE A 90 71.371 -0.445 24.637 1.00 7.54 £ 

856 CD1 ILE A 90 71.749 2.597 23,66B 1,PQ 2Q,71, £ 

40 ajsm 959 N ILE A 91 79,755 -2,733 22,114 1,PP 14,99 E 

AXSH 96P CA ILE A 91 79.P47 -3,953 22.442 1-QP 21,33 £ 

ATOM 861 C ILE A 91 70.927 -5.098 21.994 1.00 26.27 £ 

ATCM 862 0 ILE A 91 71-21; -6 T QH 22.751 l.PO 26,56 O 

ATCM 863 CB ILE A 91 68.556 -3. 930 21.814 1.00 20.39 £ 

45 ATCM 864 CGI ILE A 91 67.692 -2.8B6 22.552 1.00 13.51 C 
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ATOM 865 CG2 ILE A 91 67.841 -5.316 21.845 1.00 11.31 £ 

AXSM 866 CPl ILB A 91 66.320 -2.648 21.907 1.00 16.23 £ 

ATCM 867 N HIS A 92 71.446 -4.983 20.785 1.00 24.12 H 

ATCM 868 ca his a 92 72.2?? -$.015 20,243 i t QQ 26,71 £ 

5 Aicb 86? c HIS A 92 73,699 -6,251 21.071 1,90 29.39 c 

ilCM 870 0 HI S A 32 73.99? -7,366 21,443 1,90 19,58 Q 

AXO! 871 C8 HIS A 92 72.561 -5,692 19.775 l.PQ 22.23 £ 

AISM 972 Cg HIS A 92 73. 365 -6.720 18.Q77 1,99 26.32 £ 

Aiai 873 NDl HIS A 92 72.798 -7.711 17.307 1.00 27.19 I? 

10 AICH 874 CP2 HIS A 92 74.699 -6.978 18.106 1.00 21.95 £ 

ATOM 875 CB1 HIS A 92 73.755 -8.487 16.826 1.00 23.66 _C 

Aiai 876 N£2 His A 92 74.918 -9,9 62 17.296 1.99 17.36 H 

AXSM 877 N ALA A 93 74.329 -5.187 21.333 1.00 15.66 N 

Aicn &ZS CA ALA A 93 75.530 -$.391 22.119 1,99 11.88 £ 

15 ATCM 979 c ALA A 93 75.222 -5 T 90Q 23,512 1.00 28.78 £ 

ATOM 88J3 Q ALA A 93 75.912 -6.790 24.037 1.00 25.23 Q 

Aiai 991 CB ALA A 93 76.139 -3.959 22.221 1.00 6.30 £ 

ATCM 982 N ALA A 93 74.142 -5.442 24.113 1.00 18.82 II 

AX£H 883 CA AIA A 94 73.777 -5.971 25.399 1.00 15.61 Q 

20 Aiai 884 C AIA A 94 73.593 -7.503 25.301 1.00 28.39 £ 

ATCM 895 O AIA A 94 74.133 -8.263 26.099 1.00 21.67 Q 

ATOM 886 CP ALA A 94 72.449 -5,279 25 T 9U 1.00 19,46 £ 

AHM 897 N HIS A 95 72.914 -7,956 24.329 1.99 26.35 H 

ATCM 888 CA HIS A 95 72.551 -9.396 24.271 1.00 24.89 £ 

25 Aiaj 889 C HIS A 95 73.845 -10.176 24.140 1.00 22.81 £ 

&X01 899 O HIS A 95 74.077 -11.136 24.865 1.00 21.44 O 

AEM 891 CB HIS A 95 71.571 -9.778 23.129 1.00 22.39 £ 

A1SH 892 CG HIS A 95 71.554 -11.250 22.831 1.00 28.73 C 

ATOM 893 NDl HIS A 95 70.979 -12.182 23.682 1.00 22.83 N 

30 ATCM B?4 CP2 HIS A 95 72-159 -11,964 21,84.5 1,99 25.22 £ 

ATSM 895 CEl HIS A 95 71.171 -13.397 23.196 1.00 22.72 £ 

ATCM 996 HS2 HIS A 95 71.911 -13.296 22.101 1.00 24.80 H 

ATCM 897 N GLK A 96 74.709 -9.658 23.281 1.00 19.97 N 

AXCH &2J CA GLW A 96 75.960 -10.299 22.917 1.00 22.27 C 

35 AISM 89? c GLN A 96 7 6,977 - 1 9,3 53 24.986 1,99 26 .58 £ 

AX2i 999 9 GLN A 96 77,936 -11,993 24,999 1,99 24.17 q 

ATOM 991 CB GLN A 96 76.642 -9.492 21.818 1.00 23.38 C 

AXOi 902 CG GLN A 96 77.043 -10.299 20.596 1.00 61.06 C 

ATOM 903 CD GLN A 96 78.033 -9.557 19.675 1.00 75.83 C 

40 Aiai 904 OE1 GLN A 96 78.999 -8.941 20.131 1.00 56.89 Q 

ATCM 9Q5 NE2 GLN A 96 77.815 -9.668 18.366 1.00100.00 N. 

ATCM 906 N ASN A 97 76,652 -9.5Q9 25,969 1,99 22.15 N 

AXOi 997 CA ASN A 9? 77,537 -9,536 26.298 1,9Q 14,74 q 

ATCM 90B C ASN A 97 76.732 -10.022 27.387 1.00 29.78 C 

45 Aiai 909 O ASN A 97 77.049 -9.762 28.564 1.00 27.09 O 
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ATOM 910 CB ASH A 97 78.241 -8.201 26.462 1.00 12.93 £ 

ATOM 911 CG ASM A 97 79.260 -7.897 25.407 1.00 24.91 £ 

ATOM 912 OD1 ASN A 97 8Q t 331 -8 T 51B 25,375 1 T QP 57 T j7 O 

ATOM 913 ND2 ASN A 97 78.839 -7.135 24.392 1.00 34.88 

5 ATOM 914 N ASP A 98 75.666 -10.732 27.055 1.00 27.98 H 

ATOM 915 CA ASP A 98 74.907 -11.361 28.089 1.00 29.25 £ 

ATOM 916 C ASP A 98 74.400 -10.379 29.164 l.QQ 37,5 3 £ 

ATCM 917 O ASP A 98 74.505 -10.634 30.36 7 1.00 36.42 Q 

ATCM 918 CB ASP A 98 75.791 -12.450 2B.700 1.00 36.37 £ 

10 ATCM 919 CG ASP A 98 7 5.016 -13.712 29.053 1.00 88.62 £ 

ATCM 920 QD1 ASP A 98 73-775 -13,749 28,977 1 T 0Q 82,53 Q 

ATCM 921 OD2 ASP A 98 75.656 -14 .670 29.542 1.00100.00 Q 

ATCM 922 N VAL A 99 73.879 -9.235 28.730 1.00 27.13 N. 

ATCM 923 CA VAL A 99 73.157 -8.351 29.635 1.00 21.57 £ 

15 ATCM 924 C VAL A 99 71.706 -8.868 29.530 1.00 16.15 £ 

ATCM 925 Q VAL A 99 71,159 -9,088 29,122 1-PQ 13,47 Q 

ATCM 926 CB VAL A 99 73.264 -6.900 29.206 1.00 24.18 £ 

ATCM 927 CGI VAL A 99 72,517 -6,015 30,199 l.QQ 14,58 £ 

atcm 928 cg2 val a 99 74,720 -6.515 29.225 l.QQ 3P.1P £ 

20 ATCM 929 H ASM A 100 71.149 -9.262 30.662 1.00 17.39 N. 

ATCM 930 CA ASN A 100 69,05? -? f ?25 30.613 LPO 25,77 £ 

ATOM 931 C ASN A 100 $8,648 -9,034 30,910 l r gQ 24,95 £ 

ATCM 932 O ASN A 100 67.498 -9.377 30.582 1-PP 2P t 88 2 

ATCM 933 CB ASM A 100 69.B46 -11. 157 31,52? 1-00 11,98 £ 

25 ATOM 934 CG ASM A 100 68.724 -12.112 31.180 l.QQ 2Q,38 £ 

ATCM 935 OD1 ASM A 100 68.737 -12.709 30.100 1.00 29.59 O 

ATCM 936 ND2 ASM A 100 67.716 -12.240 32.076 1.00 16.35 N. 

ATCM 937 N LY5 A 101 68.941 -7.923 31.584 1.00 17.91 H 

ATCM 938 CA LYS A 101 67.970 -6. ?16 31 . 99 4 1-PO 25,43 C 

30 ATOM 939 C LYS A 101 68.107 -5.510 31.323 1,PP 25,39 £ 

ATCM 940 O LYS A 101 69.151 -4.850 31.377 1.00 19.88 Q 

ATOM 941 CB LYS A 101 67.996 -6.807 33.52 1 1.00 29.28 £ 

ATCM 942 CG LYS A 101 67.464 -8.054 34.205 1.00 9.31 £ 

ATCM 943 CD LYS A 101 67,218 -7,71? 35,668 1,00 38,93 £ 

35 ATCM 944 CE LYS A 101 66,2Q6 -6,569 35.995 1-pQ 13,39 £ 

ATCM 945 NZ LYS A 101 64,75P -7.PQ6 35,825 1-0P 15-26 N. 

ATOM 946 N LEU A 102 67.013 -5,043 3Q.732 1,00 22-22 H 

ATCM 947 CA LEU A 102 6 7.003 -3.744 30.092 1.00 15.40 £ 

ATCM 948 C LEU A 102 65.612 -3.115 30.156 1.00 18.55 C 

40 ATCM 949 O LEU A 102 64.590 -3 .811 30.102 l.QQ 18.92 O 

ATOM 950 CB LEU A 102 67.465 -3.898 28.636 l.QQ 11.23 £ 

ATCM 951 CG LEU A 102 67.553 -2.711 27.651 l.PQ 15.51 £ 

AX£K 952 CP1 LEV A 1P2 69,628 -2,995 26,559 1,00 9,65 £ 

ATCM 953 CD2 LEU A 102 66.162 -2.407 26.995 1,00 13,10 £ 

45 ATCM 954 M LEU A 103 65.595 -1 .798 30.318 1 , QQ 1? , Q5 N_ 
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AI03 955 CA LEV A 10? 64.356 -1.036 30.265 1.00 16.23 £ 

ATCfl 956 C LEU A 103 64.346 -0.072 29.046 1.00 19.65 £ 

AXSM 957 Q LEV A 193 65,215 0.789 28.875 1.00 19.68 2 

Aiai 959 CB LEV A IP? 64.099 -0.289 31.56 2 1.00 12.28 £ 

5 AI2i 959 £G LEV A 193 62.686 0.259 31.594 1.00 14.13 C 

ATCM 960 CD1 LEU A 103 61.645 -0.822 31.902 1.00 10.31 £ 

Aiai 961 CD2 L EU A 103 62.646 1.360 32.601 1.00 12.30 £ 

MSB 962 H CHE A 191 63.417 -0.333 28.140 1.00 16.41 £ 

ATCM 96? CA PHfi A 191 63,215 0.486 26.956 1.00 18.32 £ 

10 AX£M S6J 2 PHE A 104 62.126 1.546 27.249 1.00 21.85 £ 

ATOM 965 O PHE A 191 61,168 1.271 27.992 1.00 18.36 Q 

ATOM 966 CB PHE A 104 62.796 -0.386 25.793 1.00 9.86 C 

AX£M 967 CG PHE A 104 62.732 0.348 24.508 1.00 16.81 £ 

ATOM 2££ CDl PHE A 104 63.894 0.714 23.B40 1.00 25.04 £ 

15 ATOM 969 CD2 PHE A 104 61.511 0.795 24.005 1.00 22.59 C 

Aiai &7J CE1 PHE A 194 63.836 1.448 22.619 1.00 31.26 £ 

Alfit? 971 CE2 PHE A 191 61.41? 1.535 22.814 1.00 15.59 £ 

Ami 972 CZ PHE A 104 62.625 1.895 22.139 1.00 11.67 £ 

A1CH 973_N LEV A 195 62 r ?1l 2.762 26.734 1.00 20.33 H 

20 971 £A LEV A 195 61.416 3.897 26.9 04 1.00 18.10 £ 

AXSM 975 C LEU A 105 60.711 4.237 25.634 1.00 17.04 £ 

ATCtf 976 P LEU A 105 61.315 4.680 24.665 1.00 18.83 O 

ATCM 977 CB LEU A 105 62.178 5.146 27.214 1.00 17.49 £ 

AXCH 978 CG LEV A 195 62.434 5.544 28.644 1.00 27.17 £ 

25 AISM 979 CPl LEV A 105 62 t 6?Q 4.349 29.574 1 .00 19.16 £ 

AX21 989 CP2 LEV A 195 63.688 6. 347 28.529 1.00 23.59 £ 

ATOM 991 W SLY A 106 59.407 4.153 25.652 1.00 20.66 N 

AICM 232 CA GLY A 106 58 .679 4.536 24.455 1.00 21.03 £ 

AXOJ 98? C GLY A 106 58.080 5.935 24.597 1.00 17.32 £ 

30 AICU 991 9 GLY A 106 58.690 6.858 25.113 1.00 26.89 Q 

AIC« 995 K SSR A 197 56.B31 6.047 24.219 1.00 22.05 H 

AISM 983 CA SEP A 197 56.177 7.317 24.288 1.00 22.12 £ 

Aiai 987 £ SEP A 197 51,696 7.212 23.923 1.00 19.06 £ 

AX£M 9JPJ! 2 SEP A 197 54.314 6. 545 22.963 1.00 27.42 Q 

35 Aiaj 989 CB SER A 107 56.882 8.232 23.300 1.00 20.99 £ 

A1SM 990 OG SER A 107 55.947 9* 133 22.776 1.00 42.85 Q 

ATCM 99 1 N SER A 108 53.826 7.890 24.671 1.00 27.42 N 

AICH 992 CA ffER A 109 52.382 7. 947 24.339 1.00 26.43 £ 

AICH 99? C SER A 199 52.111 8.259 22.B42 1.00 30.97 £ 

40 AISM 994 p SgR A 109 51.242 7.709 22.217 1.00 33.46 Q 

AlfiM 995 CB SER A 198 51.719 9.072 25.144 l.QO 19.R7 £ 

ATOM 996 OG SER A 108 52.495 10.266 25.071 1.00 70.88 O 

AICH 997 N CYS A 199 52,927 9.180 22.278 1.00 24.73 a 

ATQM 996 CA CYS A 109 52.728 9.549 20.880 1.00 25.61 £ 

45 A12H 999 £ CYS A 109 52.970 8.482 19.815 1.00 21.29 £ 
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AI2M. 1945 CPl LEV A 114 49,99} 9.797 13,594 1*99 43.29 £ 

ATCM 1945 CP2 LEV A 114 41,139 19.512 15,399 1,99 25,95 £ 

ATOM 1047 n ALA A 115 43,19? 5,473 18,527 1,99 ??,pp M 

ATCM. 1MB CA ALA A 115 42,929 5 t 445 19.529 1-QQ 25.66 £ 

5 ATOM 1049 C ALA A 115 41 .7 22 5.727 20.454 1.00 28.76 £ 

ATOM 1050 O AIA A 115 41.364 6.855 20.682 1.00 24.12 Q 

ARB 1051 CB ALA A 115 44.177 5.272 20.326 1.00 16.86 £ 

A TPM 1952 N LYS A 116 41.137 4.675 29.999 1.99 39.21 U 

AI34 1953 CA LYS A 116 49,036 4.792 21.929 1.99 25.85 £ 

10 AJjai 1954 C LIS A 115 49.669 5.248 23.195 1.00 14.18 £ 

AXCM 1055 O LYS A 116 4 1,750 4.781 23.535 1.00 23.51 Q_ 

AEM 1056 CB LYS A 116 39.369 3.415 22.115 1.00 22.05 £ 

ATOM 1057 CG LYS A 116 39.053 3.032 23.524 1.00 55.38 £ 

A TPM 105J CD LYS A H6 37,96? 1.955 23.549 1.00100.00 £ 

15 ATOM 1959 CE LYS A 115 37.129 1.953 24.835 1.00100.00 C 

AX01 1060 NZ LYS A 116 35.767 1.310 24.630 1.00100.00 H 

ATOM 1051 H <?Lff A 117 49.921 6,299 23.855 1.99 19.2? H 

ATCM 1M2 CA GLN A 117 40.456 6.757 25.180 1.00 21.01 £ 

ATCM 1063 C GLN A 117 39.695 6.178 26.383 1.00 30.96 £ 

20 AT£M 1Q54 O <?LN A 117 38,493 6.009 26.345 1.00 27.66 O 

ATCM 1065 CB GLN A 117 49,215 8.263 25.179 1.00 11.32 £ 

AXCH 1066 CG GLN A 117 40.849 8.912 23.948 1.00 12.12 £ 

ATCM 1957 CP SLK A U7 42.494 9.823 23.954 1 .99 24.19 £ 

Alas 1068 QB1 gln A U7 43-041 9,529 22,996 j.op 37,99 2 

25 ATOM 1069 NE2 GLN A 117 43.001 8.953 25.131 1.00 14.24 H 

AXCM 1979 N PRO A lis 49,374 5,992, 27.499 1.99 30,9? N 

ATCtf 1071 CA PRO A 118 41,826 6.194 27.655 1.00 26.44 £ 

A TCM 1072 C PRO A 118 42,459 5.050 26.899 1.00 24.37 £ 

ATOM 1073 Q PRO A 118 41.792 4.027 26.726 1.00 25.34 O 

30 ATOM 1074 CB PRO A 118 42.055 5.994 29.167 1.00 23.89 £ 

ATOM. 1075 CG PRO A 118 40.847 5.240 29.654 1.00 23.20 £ 

axai 1975 CP PRO A 118 39.595 5,519 28.799 1,99 15,79 £ 

ATCM 1077 N MET A 119 43.684 5.228 26. 432 1.00 16.00 H 

ATCM 1078 CA MET A 119 44.372 4.215 25.644 1.00 10.80 £ 

35 AlOl 1079 C MET A 119 45.062 3.083 26.444 1.00 23.61 £ 

ATCM 1999 o MET a n? 45,913 3,281 27,299 1-09 18.92 o 

ATCM 1 981 CP M ET A 1 19 45.384 4- 99 4 24 ,79 1 1.99 13.52 £ 

ATCM 1982 Cg MET A 119 44.991 5,914 23.989 1.99 18,52 £ 

ATOM 1993 SP MET A 119 46.157 7.054 23.271 1.00 26.27 g_ 

40 ATCM 1084 CE MET A 119 46.264 6.524 21.845 1.00 33.79 C 

A TOM 1985 N ALA A 129 44.559 1.875 26.271 l.OO 26.64 U 

AXOJ 10B6 CA ALA A 120 45.177 0.712 26.884 1.00 29.17 £ 

AXCM 1997 C ALA A 129 45.356 9,398 25.984 1,99 23.21 £ 

AICM 1988 Q ALA A 129 45.439 9.759 24.833 1,99 29.19 Q 

45 AlCH 1089 CB ALA A 120 44.169 -0.419 26.944 1,99 25.92 £ 
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ATCM 1090 N GLU A 121 47.238 -0.553 26.507 l.QO 12.30 & 

ATOM 1091 CA GLU A 121 48.427 -1.009 25.788 1,09 Z^AS £ 

Aiai 1Q?2 C git? A 121 49.979 -1,697 24,459 1,99 11,68 £ 

ATCM 1093 O GLU A 121 4B.828 -1 .670 23.450 1.00 14.84 Q 

5 ATOM 1094 CB GT.U A 121 4?. 321 -1.983 26.715 1,00 16,74 £ 

ATCM 1095 CG GLU A 121 50.132 -1.122 27.763 1.00 19.14 £ 

ATOM 1096 CD GLU A 121 49.458 -1 .000 29.137 1.00 13.00 £ 

ATOM 1097 OE1 GLU A 121 48.252 -1.294 29.276 l.QO 20.79 Q 

ATCM 1098 OE2 GLU A 121 50.123 -0.521 30.080 1.00 17. B6 Q 

10 atcm 1099 h SEP A 122 16,987 -2,273 24,499 1,99 11,79 n 

&XCH 1199 CA SBR A 122 16.127 -2,977 23.219 1.99 12,16 £ 

ATCM 1101 C SER A 122 46.030 -2.058 22.100 1.00 11.70 C 

ATCM 1102 O SER A 122 45.717 -2.529 21.010 1.00 13.91 Q 

AT CM HO? cp SER A 122 45,186 -3,791 23,569 1.99 21,59 £ 

15 ATOM 1104 OG SER A 122 44.143 -2.908 23.976 1.00 28.52 Q 

ATCM 1105 N GLU A 123 46.041 -0.754 22.341 l f 99 14-65 H 

ATCM 1106 CA GLU A 123 45,783 9.292 21.243 1-09 17,15 C 

atcm 1107 c glu a 123 46.959 9.313 29.249 1,99 11.49 £ 

ATCM 1108 O GLU A 123 46.821 0.844 19.141 1.00 11.19 <2 

20 ATCM 1109 CB GLU A 123 45.481 1.699 21,995 1,99 21.66 £ 

ATCM 1110 CG GLU A 123 44.127 1,694 22.523 1-Q0 24.68 C 

ATCM 1111 CD GLU A 123 42,994 1,374 21-585 l.QO 35.56 £ 

ATCM 1112 OE1 GLU A 123 43.019 1.9 65 29 .426 1,99 4 1 . 7? Q 

atcm 1113 OE2 GLU A 123 42.159 9.497 21,949 1,99199.99 Q 

25 atom 1114 n LEV A 124 49, 13* -9-185 29,619 1,00 14,92 H 

ATCM 1115 CA LEU A 124 49.296 -0.082 19.740 1,99 15,32 £ 

ATCM 1116 C LEU A 124 49.082 -0.754 18.458 1.00 17.76 £ 

AXCH 1117 O LEU A 12 4 4,8.752 -l,9;7 19,445 1-09 19,91 Q 

ATCM 1118 CB LEU A 124 50.564 -0.680 20.362 1.00 18.07 £ 

30 ATCM 1119 CG LEU A 124 51.922 -0.222 19.803 l.QO 21 ,52 £ 

ATCff 1129 gPl LEV A 124 52.080 1,258 20,117 l.QQ 29,35 £ 

atcm 1121 CD2 leu a 124 53.942 -9,919 29,559 1,99 14.97 £ 

ATCM 1122 N LEU A 125 49.514 -0.071 17.409 1.00 18.44 H 

ATCM 1123 CA LEU A 125 49,445 -Q.564 16,952 1,QQ 19,92 £ 

35 ATCM 1124 C LEU A 125 48 .034 -0 .754 15.509 1.00 25.56 C_ 

atcm 1125 o LEV A 125 47,B54 -1,18$ 14.364 l.QO 19. ?6 Q 

ATOM 112 6 CB LEU A 125 50.355 -1.800 15.840 1.00 20.79 C 

ATOM 1127 CG LEU A 125 51.890 -1.511 15.778 1,99 17,23 £ 

ATOM 112 8 CD1 LEU A 125 52.744 -2.649 16.316 1.00 19.95 C 

40 ATOM 1129 CD2 LEU A 125 52.334 -1.219 14.338 1.00 5.B1 C 

ATOM 1130 N GLH A 126 47.027 -0.327 16.276 1.00 21.97 H 

ATOM 1131 CA GLNA 126 45.652 ~9.594 15.790 1,Q9 19.97 £ 

ATCM 1132 C GLN A 126 45.213 0.447 14.724 1,99 ?8 t ?l £ 

ATOM 1133 O GLNA 126 44.076 0.391 14.293 1.00 47.49 Q 

45 ATOM 1134 CB GLN A 126 44.652 -0. 404 16.911 1.00 19.87 C 
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ATOM 1135 CG GLN A 126 44.949 -1.312 19,949 1.00 18.39 C 

AXfiM. 11 3$ CP G LN A 1 26 14 ,319 - 2, 6 26 1 7 ,935. 1,99 66,99 £ 

ATCM. 1137 pel GLN A 126 44,964 -3,376, 19.732 1,99 49,75 Q 

ATCM 1139 NE2 cm A 126 44,915 zl^Z 16,565 1.99 71,74 li 

5 Aiai 1139 N GLY A 127 46.080 1.330 14.270 1.00 28.29 H 

AlOl 1140 CA GLY A 127 45.627 2.260 13.252 1.00 23.31 £ 

ATOM 1141 C GLY A 127 46.662 3.315 12.953 1.00 22.90 £ 

him 1142 9 GLY A 127 47.755 3,254 13,474 1,99 25,39 Q 

ATOM 1143 P THR A 128 46,311 4,219 12,916 1,90 19,51, m 

10 Aiai 1144 CA THR A 12? 47.149 5,314 11,599 1.99 22,12 £ 

Aiai 1145 C THR A 128 47.705 6.219 12.695 1.00 22.60 C 

AJCM. 1146 O THR A 128 47.061 6.461 13.731 1.00 1B.5B O 

ATCM 1147 CB THR A 128 46.392 6-192 10.544 1.00 35.98 £ 

ATCM llifi OG1 THR A 128 46.533 5.594 9.239 1.00 58.05 Q 

15 atom 1149 CG2 THR A 128 16.942 7,639 19.542 1,99 43.41 £ 

AXOJ 1150 N LEV A 129 49,907 6,715 12,425 1-00 18.32 U 

AISM 1151 CA LBV A 129 49,674 7.531 13.356 1.09 16.76 £ 

Aim 1152 £ LEV A 129 49.504 9,959 12.967 1.00 4.89 £ 

A TOM 1153 O LEU A 129 49.232 9.260 11.814 1.00 16.14 Q 

20 Aim 1154 CB LEV A 129 51.205 7.191 13.261 1.00 17.91 £ 

ATCM 1155 CG LEV A 129 51.769 5.804 13.752 1-00 18.21 C 

ATOM 115 6 CD1 LEU A 129 53.132 5.379 13.193 1.00 12.12 £ 

ATOM 1157 CP2 LBV A 129 51,683 5.532 15.251 1,99 3.99 £ 

AXSM 1159 W GLV A 139 19,916 9.927 13.917 1,00 19,23 H 

25 AXSM 1159 CA GLV A 139 19.912 11,269 13.691 1,99 13,22 £ 

Aiai 1169 C gfcv A 139 51,129 11,544 12,775 1,99 23.44 £ 

ATOM 1161 O GLU A 130 52.249 11.162 13.090 1.00 21.23 O 

ATOM 1162 CB GLU A 130 50.150 11.979 15.035 1.00 18.48 C 

Aiai 1163 CG GLU A 130 50.754 13.376 14.886 1.00 77.44 £ 

30 atcm 1164 CP glu a 139 49-833 14,328 14,121 1,09199,99 £ 

AX21 1165 PEl ffLV A 1?Q 48,588 14,205 14,349 l-QQ 36,19 o 

AXSH 1166 Qfi2 gLV A 139 59.347 15.161 13.295 1,90 21.93 Q 

AICH 1167 W PRO A 131 50.920 12.219 11.648 1.00 21.35 £ 

Aiaf 1168 CA PR O A 131 52.023 1 2.409 10.731 1.00 14.78 C 

35 ATOM 1169 C PRO A 131 53.201 13.132 1 1.265 1.00 14.98 £ 

ATCM 1170 O PRO A 131 54.325 12.847 10.853 1.00 20.99 Q 

AICB 1171 Cg PRC* A 131 51,413 13-154 9,552 1,99 14,76 £ 

ATCM 1172 CG PRO A 131 59.971 13.485 9.949 1,99 29.99 £ 

AIOl 1173 CP PRO A 131 19.641 12.626 11.047 1.00 17.25 £ 

40 AICU 1174 N THR A 132 52.986 14.095 12.159 1.00 18.77 K 

ATOM 1175 CA THR A 132 54,131 11,939 12,699 1.00 16,44 £ 

ATOM. 1176 C THR A 132 55.102 13.951 13.40B 1.00 21.91 £ 

AlflK 1177 0 THR A 132 56,317 14,988 13,234 1,99 24,17 Q 

Aiai 1179 CB THR A 132 53,716 15.997 13.696 1,99 23,15 £ 

45 ATCU 1179 ecu thr a 132 52.976 16,893 12-950 i.oo 31,15 Q 



WO 99/646 1 8 PCT/US99/1 1 576 

108 



10 



15 



20 



25 



30 



35 



40 



45 



ATCM 


1199 


CG2 


THR A 


132 


54.969 


16.519 


14,341 


1.00 


9.28 


c 


ATOM 


1191 


N 


ASN A 


133 


54.551 


12.970 


14,122 


1.00 


28.59 


N 


ATCM 


1182 


CA_ 


ASN A 


133 


55,35? 


12.997 


14,975 


1,99 


26,39 


C 


ATCM 


1183 


c 


ASN A 


133 


55, §66. 


19. §92 


14.297 


1,99 


14,95 


c 


ATCM 


1184 


Q 


ASN A 


133 


56.446 


9.884 


14.755 


1.00 


18.67 


0 


ATCM 


1185 


Cfi 


ASN A 


133 


54, §61 


11.699 


16.168 


1.00 23.70 


c 


ATCM 


1186 


CG 


ASN A 


13? 


54.480 


12.B94 


16.969 


1.00 




C 


ATCM 


1197 


OD1 ASN A 


133 


53,354 


13.272 


17.252 


1,99 


49,97 


o 


ATCM 


1188 


NP2 


ASN A 


133 


55,569 


13,639 


17.163 


1,99 


40,36 


N 


ATCM 


1189 


N 


GZ.V A 


134 


55.100 


10.469 


13.922 


1.99 


9,99 


N 


ATCM 


11?Q 


CA 


GLU A 


134 


55.237 


9.210 


12.365 


1.00 


9.66 


c 


ATCM 


1191 


c 


GLU A 


134 


56.648 


8.530 


12.274 


1.00 


13.86 


C 


ATCM 


1192 


o 


GLU A 


134 


56.814 


7.388 


12.706 


1.00 


22,99 


o 


ATCM 


ll?3 


CP 


GLU A 


134 


54.44B 


9.200 


11.070 


1.00 


17,55 


c 


ATCM 


1194 


CG 


<5LV A 134 


54.750 


7.939 


19.227 


1,90 


20. B9 


c 


ATOM 


1195 


CD 


GLU A 134 


53,?26 


7.868 


8.970 


1 .00 


13.59 


c 


ATCM 


1196 


961 


GLV A 


134 


52.679 


7.738 


9.085 


1 . 00 


35,28 


9 


ATOM 


11?7 


9E2 


SLV A 


134 


54,4?7 


9,949 


7.969 


1.00 


13,44 


9 


ATCM 


1198 


N 


PR9 A 


135 


57.680 


9.222 


11.789 


1.00 


15.72 


N 


ATCM 


1199 


CA 


PRO A 


135 


59.014 


8.600 


11.699 


1.00 


18.91 


c 


ATCM 


1200 


c 


PRO A 


135 


59.544 


B.174 


13.073 


1.00 


18.68 


c 


ATCM 


1201 


_2 


PR9 A 


135 


60.072 


7.069 


13.271 


1 - uu 




o 


ATCM 


1202 




PRO A 


135 


59.896 


9,755 


11.16? 


1 nn 

1 - uu 


1 "3 OA 

13,94 . . ._„ 


c 


ATCM 


1203 




PRO A 


135 


5?, 936 


19,514 


19,359 


i nn 
1 . UU 




c 


ATCM 


1294 




PRQ A 


135 


57,594 


19,395 


19,998 


i nn 


1 A il 


c 


ATCM 


1205 


VT 


TXR A 


136 


59.44? 


9,117 


13.994 


i nn 




N 


ATCM 


1296 


,.£A 


TYR A 


136 


59.873 


8.915 


15.324 


i nn 


1 1 01 


c 


ATOM 


1207 


c 


m a 


136 


59,056 


7.728 


15.907 


i nn 
1 . UU 


lt>. OS 


c 


ATCM 


1208 


O _ 


TYR A 


136 


59.578 


6.903 


16.658 


i nn 


i •> on 


o 


ATCM 


120? 




TYR A 


136 


59.604 


10.234 


16.199 


i nn 
1 . UU 


I 5 ,5,1 


c 


ATCM 


1219 


CG 


TYR A 


136 


59.912 


19,169 


17.614 


i nn 


J- %i 4° 


c 


ATCM 


1211 


CDl 


TYR A 


136 


61.200 


19.962 


19.972 


i nn 


9n 


c 


ATCM 


1212 


CD 2 


TYR A 


136 


59.994 


19,159 


19,569 


1. 00 


17.38 


C 


ATCM 


1213 


CE1 


TYR A 


136 


61,494 


9.959 


19.440 


1.00 


30.44 


c 


ATCM 


1214 


CE2 


_TYR A 136 _ 


59.184 


10.084 


19.953 


1.99 


9.85 


C 


ATCM 


1215 


CS 


TYR A 


136 


60.476 


9.949 


20.377 


1,99 


20.65 


c 


ATCM 


121$ 


9H 


TYR A 


136 


69.792 


9,973 


21,734 


1.99 


24.41 


0 


ATCM 


1217 


N 


ALA A 132 


57,7§p 


7.687 


15.639 


1,09 


7.19 


N 


ATCM 


1218 


CA 


ALA A 


137 


56.923 


6,633 


16,227 


1.00 


12,69 


c 


ATCM 


1219 


c 


ALA A 


137 


57.345 


5.265 


15.737 


1-90 


15.21 


c 


ATCM 


1220 


o 


ALA A 


137 


57.425 


4.272 


16.488 


1-90 


14.58 


o 


ATCM 


1221 


CP „ 




137 


55.517 


6.849 


15.871 


1.00 


11.40 


c 


ATCM 


1222 


Ji — 




139 


57.5§7 


5,213 


14.447 


1,99 


9,93 


N 


ATCM 


1223 


GA_ 


IhZ A 


139 


57,954 


3.971 


13.931 


1,99 


11,77 


C 


ATCM 


1224 


c 


ILfi A 


138 


59,246 


3,494 


14.492 


1,99 


16.29 


C 



WO 99/64618 



PCT/US99/11576 



109 



1225 o ile a 138 59.3Q7 2,377 14.979 1,99 13,79 2 

AISH 122$ CB ILE A 139 59,9 64 1,172 12.315 1,99 17,95 £ 

ATCM. 12 27 CC 1 ILE A 138 56.699 4.473 11.757 1,00 29.21 £ 

AlfiM 1229 C<?2 ILE A 139 59,6?4 2.996 11.692 1,00 9.91 £ 

5 hX£& 1229 CP1 ILE A 139 55,695 3.37$ 11.970 1,90 19,17 £ 

ATCM 1230 W AIA A 139 60.243 4.361 14.625 1.00 11.54 N 

ATCM 1231 CA AIA A 139 61,194 3,937 15.289 1.09 13,22 £ 

atcm 1232 c AiA a 139 61.25$ 3.364 16.675 1,90 19,73 £ 

atcm 1233 o aia a 139 61,791 2,319 17,031 1.00 29,44 2 

10 AXai 1234 CB ALA A 139 62,434 5,973 15,399 1,99 13.62 £ 

Aiai 1235 N LYS A 149 69,397 4,033 17,449 1.99 16.36 U 

ATCM 1236 CA L YS A 140 60.083 3.60Q 18.815 1.00 15.14 £ 

ATCM 1237 C LYS A 140 59.392 2.262 IB. 824 1.00 15.18 C 

ATCM 1238 O LYS A 140 59.924 1.346 19.475 1.00 21.42 2 

15 Aiaj 1239 CB LYS A 149 59,193 4,69$ 19,525 1,90 17.86 £ 

ATCM 1240 C<? LYS A 149 59.925 5.806 20.152 1,99 21,11 £ 

ATC M 1241 CP LYS A 149 61.298 5.478 20.958 1.09 16,75 £ 

AXOi 1242 CE LYS A 149 61.664 6.735 21.835 1,09 19.06 £ 

ATCM 1243 wz lys A 140 62.689 6.496 22.921 1.00 14,49 H 

20 ATOM 1244 N ILE A 141 58.356 2.116 18.027 1.00 11.49 H 

ATCM 1245 CA ILE A 141 57.703 0.828 17. 977 1.00 17.92 £ 

ATCM 1246 C ILE A 141 58.729 -0.282 17.577 1-00 13.46 £ 

AXJ21 1247 O TLB A 141 58.730 -1.374 18.148 1.00 13.92 Q 

ATCM 1248 CB ILE A 141 56,497 9,925 17.019 i,qq 22.59 £ 

25 atcm. 1249 cg-1 ile A 141 55.466 l T 9Q6 17,557 1,00 17,61 £ 

ATCM 1250 CG2 ILE A 141 55.863 -0.411 16.700 1.00 10.49 £ 

ATCM 1251 CD1 ILE A 141 54.530 2,327 16,449 1,99 13.43 £ 

ATCM 1252 N ALA A 142 59.637 9,9 29 16.650 1.00 10.29 H 

ATCU 1253 CA ALA A 142 69. $57 -0.931 16,229 l-QO 7,15 £ 

30 ATCM 1254 C ALA A 142 61.456 -1.301 17.456 1.00 16,58 £ 

atcu 1255 o ALA A 142 61,839 -2,454 17,621 1.99 13,94 Q 

ATCM 1256 CB ALA A 14? 61,694 -9,299 15,139 1-09 4,44 £ 

ATOM 125 7 N GLY A 143 61.703 -0.307 18.316 1.00 9.56 Jf 

ATOM. 1258 CA GLY A 143 62.448 -0.525 19.527 1.00 5.15 £ 

35 ATCM 1259 c gly A 143 61,779 -1.555 29.439 1,00 16,36 £ 

ATOM 1260 O GLY A 143 62.392 -2.482 20.967 1.00 14.11 Q 

ATCM 1261 N ILE A 144 69,476 -1,418 29,564 1,90 20,33 U 

atcm 1262 CA ILE A 144 59.725 -2,314 21,497 l f 90 15-35 £ 

ATCM 1263 C ILE A 144 59.796 -3.732 29.959 1,99 19,94 £ 

40 atcm 1264 o ILE A 141 59.93$ -4.799 21.698 1.99 17.93 S 

ATCM. 1265 CB IL E A 144 58.317 -1.819 21.559 1.00 10.60 C 

ATCM 1266 CGI ILE A 144 58.311 -0.610 22.516 1.00 9.80 £ 

ATCM 1267 CG2 ILE A 144 57.410 -2.928 22.122 1.00 9.60 £ 

ATCM 1268 CD1 ILE A 144 57.022 9.97$ 22,517 1.99 18.32 £ 

45 ATCM 1269 K LYS A 145 59.520 -3.841 19.556 1.00 7.20 (J 
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ATCM 1270 CA LY5 A 145 59,459 -9.13? 19.926 1,90 7,64 £ 

ATOM 1271 c LIS A 115 69,949 -5,799 18.931 1,99 15,32 £ 

atom 1272 o LYS A 145 69.923 -6,999 19,991 1.99 14,76 q 

ATOM 1273 CB LYS A 145 58.891 -5.001 17.516 1.00 11.25 £ 

5 ATOM 1274 CG LYS A 145 57.414 -4.581 17.489 1.00 12.13 C 

ATOM 1275 CD LYS A 145 56.642 -5 .434 18.495 1.00 25.23 £ 

ATOM 1276 eg LYS A 145 55.199 -4,995 19.692 1,00 13 T 64 £ 

ATOM 1 277 WZ LYS a 11 5 54 ,44 1 -6 rlll 19.392 l.QQ 11.94 M 

ATOM 1278 N LEU A 146 61.934 -5.011 18.9B6 1.00 26.98 X 

10 ATOM 1279 CA LEU A 146 63.261 ~5.642 19.167 1.00 19.72 £ 

ATOM. 1280 C L EU A 146 63.262 -6.316 20.542 1.00 18.20 £ 

Aiaa 1281 O LEU A 146 63.590 -7.511 20.703 1.00 19.86 Q 

ATOM 1282 CB LEU A 146 64.398 -4.618 19.150 1.00 13.56 £ 

ATOM 12 93 cg LE V A 1 46 64 .9 95 -4, 259 17.759 l.oo 21.84 £ 

15 ATOM 1294 CP1 LEV A 146 65.672 -2.945 17,917 IrQQ 17.94 £ 

Aiaj 1285 £P2 LEV A 146 65,745 -5,397 17.192 1,QP 16.19 — c 

ATOM 1286 W CY5 A 147 62.931 -5.523 21.548 1.00 7.91 N 

ATOM 1287 CA CYS A 147 62.875 -6.064 22.893 1.00 9.14 £ 

ATOM 1288 C CYS A 147 62.072 -7.378 22.945 1.00 22.72 £ 

20 ATOM 1299 9 CYS ft 147 62.568 -8.401 23.383 1.00 16.90 O 

ATOM 1290 CB CYS A 147 62.232 -5.058 23.809 1.00 12.63 £ 

ATOM 1291 sg CYS A 147 63.411 - 3. 923 24.3 16 1,99 1 5,9 2 £ 

ATOM 1292 P GLV A 149 69,923 -7,352 22,509 1,99 29,93 n 

ATOM 1293 CA SLV A 149 69.916 -9,555 22,567 1-99 16,99 £ 

25 AnM 1294 C glu a 148 69,695 -9,715 21,992 1.99 22.61 £ 

ATCU 1295 O GLU ft 148 60.651 -10.888 22.226 1.00 12.05 Qj 

ATOM. 1296 CB GLU A 148 58.597 -8.268 22.046 1.00 14.66 £ 

ATOM 1297 CG GLU A 148 57.864 -7.189 22.84 0 1.00 11.45 C 

ATOM 1298 CD GLU A 148 56.471 -6.821 22.277 1.00 11.75 £ 

30 AT£M 1299 OE1 GLU A 148 56.117 -7.055 21.080 1.00 11.65 O 

ATOM 1300 OE2 GLU A 148 55.728 -6.231 23.081 1.00 22.56 O 

ATOM 1391 W SER A 119 61-368 -9,377 29,715 1-09 15,57 n 

ATOM 1302 CA SER A 149 61.938 -10.428 19.887 1.00 10.21 £ 

ATOM 1303 C SER A 149 63.040 -11.245 20.502 1-00 15.83 C 

35 ATOM 1304 O SER A 149 63.102 12.458 20.291 1.00 12.72 Q 

AXa* 1305 cp SER A 149 62.270 -9,936 19.499 1.90 9,44 £ 

ATOM 1396 W sbr A 149 61,953 -9.659 17,782 1.99 15,91 Q 

ATOM 1307 n tyr a 150 63.910 -19 T 546 21,224 l-QQ *8 t 44 n 

ATOM 1308 CA TYR A 150 65.065 -11.100 21.948 1.00 20.50 £ 

40 AIQH 1399 £ TYR A 150 64,514 -11.949 23.159 1,99 21.97 £ 

ATOM 1310 O TYR A 150 64.939 -12.949 23.486 1.00 31.39 Q 

AXCU 1311 CB TYR A 150 66.005 -9.950 22.425 1,99 13, 71 £ 

ATOM 1312 CG TYR A 150 66.994 -9.509 21.365 1.00 14.13 £ 

ATOM 1313 CD1 TYR A 150 66.611 -8.673 20.317 1,99 14,64 £ 

45 AJCU 1314 CD2 TYR A 150 68.288 -10.000 21.360 L99 19,32 £ 
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atom 1315 cei tyr a 150 67.487 -8.390 19,279 l.PP 11.91 £ 

atom 1316 ck3 tyr a 150 69,198 -9.692 20.345 j.oo ugo C 

ATOM 1317 C2 TYR A 150 6B.B04 -8.900 19.326 1.00 20,95 £ 

ATOM 1318 OH TYR A ISO 69.739 -8.685 18.333 1.00 27.73 Q 

5 ATOM 1319 N ASH A 151 63-536-11.249 23.B01 1.00 14.83 H 

ATOM 1320 CA ASH A 151 62.903 -11.889 24.937 1,00 23.62 £ 

ATOM 1321 C ASH A 151 62.417 -13. 244 24.410 l t QQ 28.53 £ 

ATOM 1322 O ASH A 151 62,630 -14.218 25,072 1,00 25,8? Q 

ATOM 1323 CB ASH A 151 fil . 655 -H.U3 25,139 1-PP 29.95 £ 

10 atom 132 4 cg ash a 151 61,988 -9,867 26.294 1-QP 15,07 £ 

atom 1325 oni ASH A 151 61.126 -9.020 26.466 1.00 26.72 O 

ATOM 1326 HD2 ASH A 151 63. 2 31 "9.799 26.7QP 1 ,QQ 6.31 H 

ATOM 1327 N ARG A 152 61.731 -13.249 23.259 1,00 19.91 H 

ATOM 132 B CA ARG A 152 61.129 -14.465 22.687 l t <?P 17.62 £ 

15 ATOM 1329 C ARG A 152 62.O90 -15.523 22.188 1.00 21.34 £ 

ATOM 1330 O ARG A 152 61.959 -16.687 22.542 l.QQ 15.44 Q 

ATOM 1331 CB ARG A 152 60.086 -14.148 21.61Q l.QQ 15,30 C 

ATOM 1332 CG ARG A 152 5B.672 -13.754 22.157 l.QQ 17.22 £ 

ATOM 1333 cn ARG A 152 57.652 -13.297 21.049 l.QQ 9,11 £ 

20 ATOM 1334 HE ARG A 152 57.161 -14.419 20.241 1.00 21.05 H 

ATOM 1335 CZ ARG A 152 57.159 -14.447 18.912 l.QQ 28.61 £ 

atom 1336 wtti arg a 152 57.590 -13.387 18.221 l.QQ 21.98 M 

ATOM 1337 NH2 ARG A 152 $6,7 1 7 -15,528 18,262 l.PP 26,11 U 

ATOM 1338 W GDI A 153 63.098 -15.104 21.434 l.QQ 16,54 M 

25 ATOM 1339 CA GLN A 153 61. PH -16. Q36, 2Q.842 l.PP 9,74 £ 

ATOM 1340 C GLN A 153 6 5.082 -16.443 21.807 1.00 16. 7Q £ 

ATOM 1341 O GLN A 153 65.529 -17.545 21.763 l.QQ 24.35 Q 

ATOM 1342 CB GLN A 153 6 4.789 -15.372 19.714 1.00 8,99 C 

ATOM 1343 CG GLH A 153 65.935 -16.225 19,116 1.00 4-6? £ 

30 ATOM 1344 rn gln A 153 66.315 -15.637 17.762 l.QQ 14,17 £ 

ATOM 1345 OE1 GLH A 153 65.611 -14.763 17.254 l.QQ 12-53 Q 

ATOM 1346 HE2 GLN A 153 67.466 -16.024 17.228 l.PP 13.38 H 

ATOM 1347 N TYR A 154 6 5.566 -15.518 22.608 l.QQ 14,35 li 

ATOM 1348 ca TYR A 154 66.677 -15.839 23.483 l.QQ 12.16 £ 

35 atom 1349 c tyr a 154 66,323 -15.93Q 24.954 l.PP 19-P6 £ 

ATOM 1350 o TYR A 154 67.185 -16.207 25.777 l.QQ 25,59 2 

atom 1351 cb tyr a 154 67.829 -14.816 23.326 l.QQ 16.89 £ 

ATOM 1352 CG TYR A 1 54 68.418 -14.733 21,943 l.QQ 17,53 C 

atom 1353 cm tyr a 154 69.259 "15, 726 21.467 l.PP 18.91 £ 

40 ATOM 1354 CD2 TYR A 154 68-080 -13.712 21.091 1.00 13,97 £ 

ATOM 1355 CF.1 TYR A 154 69.782 -15.686 20.190 l.QQ 10.98 £ 

ATOM 1356 CE2 TYR A 15 4 68.621 -13.639 19.806 l.QQ 23. 91 £ 

ATOM 1357 CZ TYR A 154 69.488 -14.634 19,380 1,90 23,99 £ 

ATOM 1358 OH TYR A 154 7P-QQ2 -14.51 1 * 19*119 1 • PP 23.87 Q 

45 atom 1359 w gly a 155 65.089 -15,696 25.313 l.QQ 12, P9 S 
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AlOi 13 69 OA GLY A 155 64.747 -15,702 26,731 1,09 15. 9P £ 

ATCM 1361 C GLY A 155 65.323-14.498 27.580 1.00 33.97 C 

ATOM 13 62 O GLY A 155 65,4?I 14.64Q 29,78? l.QQ 25,76 Q 

ATCM 1363 N ARG A 156 65, $64 -13.319 26.991 1,PQ 25,91 H 

5 AlfiM 1364 CA ARG A 156 66.966 -12.146 27,734 l.QQ 14,13 £ 

ATCM 1365 C ARG A 156 64.971 -11.4B6 2B.5B1 1.00 16.23 £ 

ATOi 1356 Q ARg a 156 63.9Q2 -11.919 29, 593 1.99 22. 61 Q 

atom 1367 cb arg a 156 66.691 -11. 124 26.750 l.QP 13.16 £ 

ATCM 1369 Cg ARg A 156 67.975 -11. 570 26.999 l.QP 15.19 £ 

10 AEM 1369 cd arg a 156 69.939 -11.419 27.121 1,99 26t42 £ 

AEM 1370 ne arg a 156 79,299 -11.912 26,633 1.99 21, 25 M 

ATCM 1371 CZ ARG A 156 71.092 -12. 555 27.386 1.00 42.25 £ 

ATCM 1372 NH1 ARG A 156 70.870 -12.795 28.679 1.00 20.02 £1 

AEM 1373 KK2 ARG a 156 72.221 -12,966 26.943 i t op 20,88 m 

15 AICM 1374 W ASP A 157 65,343 -19.446 29.321 l.op 16,99 H 

ATOM 1375 CA ASP A 157 64.370 -9.749 30.166 1.00 16.20 £ 

ATCM 1376 C asp A 157 64,444 -9.245 29.941 1.99 19.29 £ 

ATOM 1377 O ASP A 157 64.865 -7.429 30.650 1.00 10.71 2 

ATCM 1378 CB ASP A 157 64.609 -10.061 31.652 1.00 16.50 £ 

20 ATOM 1379 CG ASP A 157 63.489 -9. 560 32.566 1.00 26.45 £ 

ATCM 1380 ODi asp A 157 62,433 -9.069 32.199 1.99 26,92 S 

AXOJ 1391 QP2 ASP A 157 63,673 -9,653 33,794 1,99 21,99 Q 

ATOM 1382 W TYR A 158 64.038 -7.921 28.620 1.00 19.41 U 

ATOM 13B? CA TYR A 159 64.099 -6.564 28.Q83 l T 0Q 18 T ?6 £ 

25 atcm 1384 c tyr a 158 62,699 -5.977 29,127 1.99 22,62 £ 

ATOM 1385 O TYR A 158 61,854 -6,296 27,292 1-QP 19,12 Q 

AXOJ 1386 cb tyr a 158 64,562 -6.661 26.631 1,9P 16.34 £ 

atom 1387 cg tyr a 158 65,992 -7.166 26.494 1.99 12.94 £ 

ATOM 1388 CD1 TYR A 158 66.789 -7.415 27.621 1.00 13.76 C 

30 ATOM 1389 CD2 TYR A 158 66.544 -7.349 25.218 1-99 16,35 £ 

ATCM 1390 CE1 TYR A 158 68.135 -7.786 27.482 1.00 8.18 £ 

ATCM I3?i cfi2 tyr a 159 67.996 -7.732 25.969 1.Q9 13-73 £ 

AX&H 1392 cz tyr a 158 69.676 -7,942 26.196 1,99 24 T 45 £ 

ATCM 1393 OH TYR A 158 69.993 -8.338 25.997 1.00 14.36 Q 

35 ATCM 1394 M ARG A 159 62,423 -5,299 29,175 1,QQ 23.53 H 

ATCM 1395 CA ARG A 159 61.105 -4.603 29.483 1.00 21.15 C 

ATCM 1396 C ARG A 159 60.930 -3.172 28.87B 1.99 23.55 £ 

ATOM 1397 Q ARG A 159 6l,9H -g t 566 29,424 l-QQ 19,12 Q 

ATCM 1398 CB ARG A 159 60.891 -4.608 31.034 1.00 21.68 £ 

40 ATOM 1399 CG ARG A 159 60.986 -6.029 31.722 1.00 16.41 £ 

AXai 1400 cd arg a 159 61.135 -6,952 33,233 l.QP 19,10 £ 

ATOM 1401 NE ARG A 159 61.305 -7. 402 33.772 1.00 19.25 N. 

atcm 1402 CZ ARG A 159 61.164 -7.720 35.058 1,QQ 36.67 £ 

ATOM 1403 MH1 ARG A 159 60.886 -6.776 35.962 1,P0 15.32 N 

45 ATOM 1404 NH2 ARG A 159 61.309 -8.986 35.448 l.QQ 11,7? U 
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ATOM 1405 H SERA 160 59.689 -2.661 28.859 1.00 24,44 U 

ATOM 1406 CX SERA 160 59.312 -1.393 28.200 1 1 99 21,5? £ 

ATOM 1407 C SER A 160 58.242 -0.577 28.950 1,09 25. Q7 £ 

ATOM 1408 O SERA lfiO 57.257 -1.127 29.454 1.00 17.02 Q 

5 atom 1409 cb SERA lfiO 58.719 -1.747 26,797 l.QQ 13,05 £ 

ATCM 1410 Qfi ser A 160 59.782 -1.897 25.895 l.QQ 37.57 Q 

ATCM 1411 H VALA 161 58.378 Q.74 .2 28.927 l t 0Q 21,91 H 

ATCM 1412 CA VAL A 161 57.369 29,599 lt99 9,7Q £ 

atom 1413 r. val a 161 57.068 2-747 26.594 1.99 16.77 £ 

10 ATOM 1414 o VAL A 161 57.955 3.149 27.729 1.99 16,33 2 

ATCM 1415 CB VAL A 161 57.806 2,249 30.862 1,09 17,94 £ 

ATOM 1416 CGI VAL A 161 57.873 1.195 31.994 1.99 15.16 £ 

ATCM 1417 CG2 VAL A 1 61 59.137 2.992 3Q.750 1.99 21,19 £ 

ATOM 1418 N MET A 162 55.794 3,147 29,443 1-90 2?. 46 E 

15 ATOM 1419 CA MET A 162 55.296 4.195 27,513 1.99 19.23 £ 

ATCM 1420 C MET A 162 54.880 5.312 29,397 1,99 25,19 C 

ATOM 1421 O MET A 162 53.788 5.269 29.961 1,99 19,35 2 

ATOM 1422 CB MET A 162 53.979 3 T 796 26.959 1.99 15.55 C 

ATCM 1423 CG MET A 162 54.013 2,639 25.949 1,99 37.79 £ 

20 ATOM 1424 SD MET A 162 54.354 3,199 24.235 1,99 52,97 £ 

ATCM 1425 CF. MET A 162 56.193 3,134 24.410 1 T QQ 36,39 £ 

atcm 1426 w pro a 163 55.739 6.313 29.521 1,99 19.43 U 

atcm 1427 ca pro a 163 55.399 7 . 47 2 29.337 1,99 17.76 £ 

ATOM 1428 C PRO A 163 54.300 8 .384 28.667 1.00 21.23 £ 

25 atom 1429 o pro a 163 54.208 9.449 27.433 — 1,90 15.20 Q 

ATOM 1430 CB PRO A 163 56.727 9.196 29.423 1,PQ 11,43 £ 

ATOM 1431 CG PRO A 1 63 57.352 7,974 29,931 1,99 13, 9? £ 

atom 1432 cn pro a 163 57.996 6,491 2 7,949 1,99 12.24 £ 

ATOM 1433 N THR A 164 53.47B 9.060 29.479 1-90 13,95 H 

30 atom 1434 ca thr a 164 5?,, 591 19.121 29,963 1.99 25,82 £ 

ATOM 1435 c THR A 164 53.406 11.441 28.781 1,00 19.67 C 

ATOM 1436 Q THR A 164 54.633 11.393 28.868 1.09 13.97 2 

ATOM 1437 CB THR A 164 51.373 10.391 29.993 1.99 25.51 £ 

ATOM 1438 OG1 THR A 164 50.470 11.321 29.267 1.00 14,77 Q 

35 ATOM 1439 CG2 THR A 164 51.819 19,986 31.298 1,99 9,96 £ 

ATOM 1440 W ASN A 165 52-751 12,589 28,556 1,99 14-99 S 

ATOM 1441 CA ASM A 165 5 3.448 13.901 28.491 1.09 7.93 £ 

atom 1442 c ash a 165 54,167 14.064 29.924 1,99 11,21 £ 

ATOM 1443 O ASN A 165 5 3.554 13.929 30.894 1,09 17,66 Q 

40 ATOM 1444 CB ASH A 165 52.434 15.061 28.416 l.QQ 14,48 £ 

ATOM 1445 CG ASH A 165 51.492 14.941 27.262 1,00 23.79 £ 

ATOM 1446 OD1 ASH A 165 51.939 14.800 26.129 1.00 22,37 2 

ATOM 1447 ND2 ASN A 165 50.173 14 -925 27.539 1.00 27.22 N 

ATCM 1448 H LEO A 166 55.418 14.490 29.77? 1.00 8-23 — 

45 ATOM 1449 LRU A 166 56.187 14.604 30.994 1.00 14.40 £ 
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bXSM 14*9 g LEV A 16$ 56,629 1 6 ,9 17 3 1.129 1.00 25.05 £ 

ATOM 1151 p LEV A 166 56,634 16.718 30.125 1.00 25.09 2 

AX21 1452 CP LSV A 166 57,459 13.71? 39.979 1-QQ 17.48 c 

Aiai 1453 CG LEU A 166 57.423 12.218 30.652 1.00 16.63 £ 

5 ATOM 1454 CD1 LEO A 166 58.837 11.639 31.000 1.00 22.52 C 

ATOM 1455 CD2 LEU A 166 56.336 11.539 31.514 1.00 7.46 £ 

ATOM 145$ H HR A 167 57.146 16.391 32.300 1.00 19.78 H 

AX£M 1457 CA TTfR A 167 57.678 17.760 32.511 1.00 18.58 £ 

A1SM 1158 £ TYR A 167 58.534 17.763 33.767 1.00 15.53 C 

10 AXCU 1453 Q TYR A 167 58.474 16.852 34.575 1.00 16.71 O 

ATOM. 1460 CB TYR A 167 56.509 18.778 32.665 1.00 18.33 C 

ATCM 1161 CG TYR A 167 55.671 18.561 33.93 1 1.00 14.23 £ 

ATOM 1462 CD1 TYR A 167 54.624 17.618 33.977 1.00 13.35 C 

ATOM 1463 CP2 TYR A 167 55.984 19.258 35.106 1.00 16.52 £ 

15 ATOM. 1161 CE1 TYR A 167 53.889 17.446 35.146 1.00 21.17 £ 

ATOM 1465 CE2 TYR A 167 55.302 19.084 36. 264 1.00 8.26 £ 

ATCM 1466 CZ TYR A 167 54.228 18.203 36.296 1.00 23.56 C 

ATCM 1467 OH TYR A 167 53.526 18.078 37.504 1.00 22.81 O 

ATCM 1168 N SLY A 169 53.334 18.797 33.952 i.oo 16. sq M 

20 AICH 14 69 CA GLY A 168 60.158 18.817 35.152 1.00 18.21 £ 

AISM 117Q c gLY A 169 6lt5?1 19,128 34.999 l.QQ 13.69 c 

AI£M 1171 P GLY A 168 61.746 20.028 33.837 1.00 16.52 £ 

ATCW 1172 N FRQ A 169 62.173 19.263 35.817 1.00 20.33 iJ 

AX£M L4J3 CA PRP A 169 63,891 19.822 35.656 1.00 16.07 £ 

25 AXSH 1174 g PRO A 169 64.430 19.353 34.387 1.00 27.18 £ 

ATOM 1475 0 PRO A 169 64.305 18.186 33.981 1.00 21.23 O 

ATCM 1476 CB PRO A 169 64.595 19.206 36.805 1.00 17.28 £ 

ATCM 1477 CG PRO A 169 63.649 18.919 37.830 1.00 19.89 £ 

ATOM 1478 CD PRO A 169 62.263 18.772 37.189 1.00 22.47 C 

30 aXOU 1179 W HIS A 17Q 65.226 20.235 33.829 1.00 19.48 N 

ATCM 1199 OA HIS A 370 65.952 19.877 32.638 1.00 25.56 £ 

ATOM 1181 C HIS A 179 65.096 19. 707 31.428 1.00 29.15 £ 

AXCM 1192 Q HIS A 17Q 65.553 19.091 30.47 9 1.00 29.71 O 

AICU 1483 CB HIS A 170 66.783 18.600 32.845 1.00 28.94 £ 

35 Aiai 1484 CG HIS A 170 67.703 18.671 34.034 1.00 33.88 £ 

ATOM 1485 ND1 HIS A 170 68.975 19.203 33.969 1.00 25.46 N 

Aiai 1196 CP2 HIS A 179 67.518 18.298 35.326 1.00 34.77 £ 

Aiai 1197 CE1 HIS A 1?9 69.531 19.151 35.166 1.00 25.63 C 

AXSM 1488 KE2 HIS A 170 68.673 18.603 36.008 1.00 31.72 N 

40 ATOM 1189 N ASF A 171 63.881 20.245 31.440 1.00 21.52 £ 

ATCM 119P CA A5P A 171 63.041 20.267 30.218 1.00 28.63 C 

ATOM 1191 C ASF A 171 63.630 21.459 29.359 1.00 41.94 £ 

ATOM. 1192 9 ASP A 171 64.534 22.171 29.835 1.00 29.69 2 

AI21 1193 CB ASP A 171 61.552 20.558 30.602 1.00 26.40 C 

45 ATQH 1494 CG ASP A 171 60.552 20. 097 29.540 1.00 22.32 £ 
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ATOM 1495 OD1 ASP A 171 60. 890 20 -067 28.325 1.00 32.03 Q 

ATOM 1496 OD2 ASP A 171 59.427 19.719 29.916 1.00 42.13 Q 

ATOM 1497 N ASM A 172 63.141 21.712 28.137 1.00 42.08 JJ 

ATOM 1498 CA ASH A 172 63,616 22,893 27,388 l.QQ 35.95 £ 

5 ATOM 1499 C ASM A 172 62.665 24.056 27.674 1.00 33.71 £ 

ATOM 1500 O ASN A 172 61.586 24. 102 27.104 1,00 32.69 2 

ATOM 1501 CB ASM A 172 63.632 22.667 25.869 l.QQ 41.60 £ 

ATOM 1502 CG ASM A 172 63.B07 23 .987 25.086 l.PQ 39. Q? £ 

ATOM 1503 OD1 ASM A 172 62.973 24.347 24.259 1.00 83.94 Q 

10 ATOM 1504 MD2 ASM A 172 64,855 24.74Q 25.4,18 ItPP. 65.Q7 H 

ATCM 1S05 N PHE A 173 63.Q21 24,95? 29r5B3 l t QQ 31.93 N 

ATOM 1506 CA PHE A 173 62.082 26 .030 2B.944 l,pQ 48.24 £ 

ATCM 1507 C PHE A 173 61.989 27.260 28.045 1.00 69.01 £ 

ATCM 150B Q PHE A 173 62.278 28.395 28.465 1.00 58.79 2 

15 ATCM 1509 CB PHE A 173 62,225 26.4,59 3Q.39Q j.pp 43,43 £ 

ATCM 1510 CG PHE A 173 61.867 25.399 31.356 l.QQ 34.19 £ 

ATCM 1511 CD1 PHE A 173 62rB10 24,498 31.751 1,PQ 24.68 £ 

ATOM 1512 CD 2 PHE A 173 60.621 25.354 31.925 1.00 24.84 £ 

atom 1513 cei phe a 173 62.524 23.518 32.682 l.QQ 23.64 £ 

20 ATCM 1514 CE2 PHE A 173 60.305 24.366 32.804 l.QQ 31,32 £ 

ATCM 1515 CZ PHE A 173 61.263 23.457 33.192 1.00 24.30 £ 

ATCM 1516 M HIS A 174 61.510 27.036 26.831 l.QQ 68,16 K 

ATCM 1517 CA HIS A 174 61.4Q1 28. IP? 25,871 1,PP 64-53 c 

ATCM 1518 C HIS A 174 59.973 28.221 25.4Q0 1 T QP 71,59 £ 

25 atcm 1519 o His h 17^ 59.3Q9 27,196 25.249 i,pp 73,20 o 

ATOM 1520 CB HIS A 174 62.418 27.870 24.736 1.00 71.71 £ 

ATCM 1521 CG HIS A 174 63.835 27.868 25.229 1.00 92.29 £ 

ATCM 1522 MD1 HIS A 174 64.921 27.539 24.440 l.QQIQQ.pO H 

ATCM 1523 CD 2 HIS A 174 64.338 28. 133 26.463 l.OOlQp.pO £ 

30 ATCM 1524 CEI HIS A 174 66.032 27.628 25.160 l.QOlOQ.OO £ 

ATOM 1525 ME2 HIS A 174 65.7Q5 27.981 26.393 1-PP1PP-PP U 

atcm 1526 N PRO A 175 59,469 29,461 25,262 1-PP 65-71 K 

ATCM 1527 CA PRO A 175 5 8.109 29.658 24.770 1.00 55.72 £ 

ATCM 1528 C PRO A 175 59.233 29.297 23.267 1.00 75.83 £ 

35 ATOM 1529 O PRO A 175 57.224 29 .226 22.554 1.00 69.59 Q 

ATOM 1530 CB PRO A 175 57.866 31.142 25-026 1 T 0Q 49-14 £ 

atom 1531 cg pro a 175 59.259 31.79Q 24.901 l.QQ 42.23 £ 

atom 1532 cd pro a 175 69,296 3P.695 25.1Q9 ItQP 49.59 £ 

ATOM 1533 N SER A 176 59.480 28. 954 22.879 1.00 85.09 H 

40 ATCM 1534 CA SER A 176 59.954 28.474 21.548 1.00 81.18 £ 

ATOM 1535 C SER A 176 59.660 2 6.965 21.343 1 T QQ 73,90 C 

ATOM 1536 O SER A 176 59.617 2 6.458 20.213 l t QQ 57-Q3 , O 

atom 1537 cb SER A 176 61.493 28.666 21.447 l.QQ 71.32 £ 

ATCM 1538 QG SER A 176 62.046 29. 349 22.578 1,PQ 51,93 Q 

45 ATOM 1539 M ASN A 177 59.520 26 .276 22.480 l.pO 66.23 N. 



WO 99/64618 



PCT/US99/11576 



116 



ATOM 1540 CA ASN A 177 59.274 24.847 22.619 1.00 56.41 £ 

ATOM 1541 C ASN A 177 57,91Q 24,497 22.353 1,99 §0,91 £ 

ATOM 1542 O ASH A 177 56,914 25r215 2,2, 9U 1,99 55,59 Q 

AT CM 1543 CB ASN A 177 59.619 24,469 24.965 1,99 50,4.5 £ 

5 ATOM 1544 CG ASN A 177 5 9.562 22.970 24.319 1.00 66.57 £ 

ATOM 1545 OD1 ASH A 177 59.095 22.216 23.476 1.00100.00 Q 

ATOM 1546 MD2 ASH A 177 60.099 22.546 25.464 1.00 35.61 U 

ATOM 1547 N SER A 178 57.583 23.387 21.627 1.00 57.10 H 

ATQM 1549 CA SBR A 179 56.234 22.953 2X.279 1,99 59.50 £ 

10 ATOM 1549 C SER A 178 55.557 22.159 22.491 1.00 76.24 £ 

ATOM 1550 O SKR A 178 5 4.575 21.400 22.304 1.00 99.63 Q 

ATOM 1551 CB SER A 178 56.316 21,999 29.11B 1,99 19,17 £ 

ATCM 1552 OG SER A 178 57.397 22.112 19.217 1,99 71.69 Q 

AlfiM 1553 H His a 179 56.134 22.294 23.694 1-QP 37.39 H 

15 AX21 1554 CA HIS A 179 55.569 21.597 24.955 1,99 39.96 £ 

AISH 1555 C HIS A 179 54.961 22.616 25.767 1 T P9 21.93 £ 

AXOi 1556 q HIS A 17? 55.641 23.599 26.139 1,09 25. 1"? Q 

ATOM 1557 CB HIS A 179 56.634 20.683 25.575 1.00 36.20 C 

ATCM 1558 CG HIS A 179 56.973 19.419 24.835 1.00 42.90 £ 

20 ATCM 1559 ND1 HIS A 179 56.973 19.335 23.457 1.00 49.52 N. 

ATCH 1560 CD2 HIS A 179 57.323 18.190 25.278 1.00 52.42 £ 

ATOM 1561 CE1 HIS A 179 57.283 18.109 23.0B4 1.00 44.78 £ 

ATOM 1562 NE2 HIS A 179 57.500 17.393 24.168 1.00 50.49 N 

ATCM 1563 N VAL A 180 53.661 22.454 26 f 038 1 T 00 19-14 N. 

25 AXCtf 1564 ca VAL A 190 52,886 23,449 25,789 1,99 29,03 £ 

ATOM 1565 C VAL A 180 53.373 23.890 28.142 1.00 31.29 £ 

AI£M. 1566 O VAL A 180 53.348 25.075 28.447 1.00 19.55 Q 

ATOM 1567 CB VAL A 180 51.403 23.115 26.914 1.00 35.47 £ 

ATCM 1568 CGI VAL A 180 50.630 24.399 27.217 1.00 35.84 C 

30 ajcu 1569 CG2 val A 19P 5P.923 22.559 25,663 1,QQ 36,11 c 

ATCM 1570 N ILE A 181 53.684 22.935 29.005 1.00 26.57 H 

ATCM 1571 CA ILE A 181 54.138 23.285 30.360 1.00 24.49 C 

AJCU 15J72 £ ile A 181 55,371 24.213 39.361 1,09 16. 51 c 

ATCM 1573 O ILE A 181 55.326 25.315 30 .909 1.00 24.42 Q 

35 ATCM 1574 CB ILE A IB 1 54.285 22.018 31.264 1.00 20.20 £ 

AICH 1575 Cjgl ILE A 191 52.979 21.429 31,529 1,99 19.22 £ 

AXStl 1575 C<?2 ile A 191 55.914 22,315 32.591 1,PP 13,37 £ 

ATCM 1577 CD1 ILE A 181 52.867 20.086 32.286 1.00 8.03 £ 

AlfiM 1578 N PRO A 182 56.452 23.779 29.718 1.00 22.21 

40 ATOM 1579 CA PRO A 182 57.664 24.605 29.640 1.00 22.07 £ 

AXCH 1580 c PRO A 192 57,379 25.952 29,929 1.99 24.19 £ 

AICH 1581 O PRO A 182 57.811 26.949 29.210 1.00 18.35 O 

ATOM 1582 CB PRO A 182 58.682 23.725 28.890 1.00 24.97 £ 

atcm 15B3 cg pro a 182 57.925 22.473 29.471 1.99 25.77 £ 

45 Ajcu 15B4 cn pro a 182 56,727 22.359 29,491 1.99 19,23 £ 
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ATOM 


1595 N ALA a 1B3 


56,628 25.707 27.729 1 nn ?1(4 ^ 






ATOM 


15B6 CA ALA A 1 R3 


56.261 26.896 26.943 1 .00 21.66 


N 




ATOM 


1587 C ALA A 1fl ? 


5^464 27.900 27.811 1 n 0 26.10 


C 




ATOM 


1588 0 ALA A 183 


55_.773 29.091 27.856 1.00 19.50 


C 


5 


ATOM 


1M9 CB ALA A 183 


55.473 26.513 25.703 1 . nn ^jj* 


o 




ATOM 


1590 N LEU A 1R4 


5A.472 27.389 28.543 i . qq 23 _ ?4 


c 




ATOM 


1591 CA LRU A 184, 


53.642 28.215 29.401 i.qq 


N 




ATOM 


1592 C LED A 1R4 


54.312 2B.693 30.655 i.qq ?i |9 1 






ATOM 


1593 O LEU A 184 


54,017 29.771 31.158 I.QQ iq i?1 


c 


10 


ATOM 


1594 CB LEU A 184 


52,309 27.553 29.715 i n op M.41 


o 




ATOM 


_1595 CG LEU A 184 


51 t 342 27. W 7B..525 1.00 23,4? 


c 




AICM 


1596 CD1 LEU A 184 


49.918 27.244 28.97R i.qq nps 


c 




ATOM 


1597 CD 2 LRU A 1»4 


51.390 28,896 27.690 l.oo 21.73 






ATOM 


- 1598 N LEU A 1R5 


55,178 27.879 31.213 1.on 18,3° 


c 


15 


ATOM 


_J599 CA LEU A IBS 


S5 t g33 2.8,33 2 32.417 1.00 16.39 


N 




ATCtf 


_J£0Q C LEU A IRS 


56.669 29,S2ft 31.985 1 - 00 23.67 


c 




ATOM 


1601 O LEU A IBS 


56.681 30.590 32,644 1 . On 29.38 


c 




ATOM 


* 6 02 CB LEU A 185 


56^223 27.233 33.015 l.nnts.os 


o 




ATOM 


1603 CG LEU A 1R5 


56.Q21 26.348 34.041 l.oo ^.56 


c 


20 


ATOM 


1604 CD1 LEU A 185 


5^.819 25.022 34,301 1 . Ofl 21.06 


c 




ATO 


_L605 CD2 LEU A IBS 


55_J22_ 27.113 35.321 1( nn n np 


c 




ATOM 


1606 N ARG A 1R6 


57.337 29.397 30,852 l.oo 17.oq 


c 




ATOM 


_A£0J7_ CA ARG A 1R6 


5^137 30.523 30.429 1 .00 |B.B2 


N 




ATOM 


l 6 ^ 8 C ARG A 186 


51,308 31.752 30.069 1.00 29.00 


c 


25 


ATOM 


_1£0.9 O ARG A 186 


57,629 32.880 30.476 l.oo 73 ^ 


c 




ATOM 


1610 CB ARG A 186 


5^4126 30.146 29.281 1.00 22.06 


o 




ATOM 


_J1611 CG ARG A 186 


59.653 31.365 28.652 1.00 3fl 46 


c 




ATOM 


1612 CD ARG A 186 


6JL825 31.804 29.462 i . nn a 3 .66 


c 




ATCM 


^1613 NE ARG A 186 


_j62.012 31.861 28.631 1.00 7 0 .77 


c 


30 


ATOM 


_J£14_ CZ ARG A 186 


63.059 32.622 28.904 l.oo 91.6b 


N 




ATOM 


1615 NH1 ARG A 186 


63.053 33.386 29.995 i.qq S fi.56 


c 




ATOM 


_J£!6 NH2 ARG A 1B6 


64.098 32.639 28. nap l.ooioo T Qn 


N 




ATOM. 


1617 N ARG A 187 


56.234 31 .544 29.310 i.qq 20 96 


ff 




ATOM 


1519 CA ARG A 1H7 


55.361 32.662 28.941 1.00 19.32 


N 


35 


ATOM 


_JL£19 C ARG A 187 


54.765 33.453 30.142 1.00 28,4! 


C 




ATOM 


_J£2JL__Q ARG A 187 


— 5A.823 34.700 30,193 l.gg 17.23 


C 




ATOM 


1621 CB ARG A 1R7 


54*270 32.223 27.957 1.00 1 7 n5 


0 




ATOM 


-1622 CG ARG A 187 


54.813 31.546 26.720 1.00 61.. 42 


c 






I 6 ? 3 £I> — ARG A 187 


53.696 31.244 25.757 1.00 44.57 


c 


40 


ATOM 


.1524 NE ARG A 187 


53.033 32.472 25.354 1.00 29.47 


, c 




ATOM 


1625 CZ ARG A 187 


5 1. 931 32.534 24.790 1.00 17 -82 






ATOM 


J^26 NH1 ARG A 187 


_LU136 31.427 24.544 1.00 24.95 


... c 




ATOM 


NH2 ARG A 187 


51.341 33.716 24.447 1.00 37.77 


N 




ATOM 


1628 N PHE A 188 


54.192 32.734 31.101 l.oo 2Mft 


N 


45 


ATOM 


1525 CA PHE A 188 


_53.604 33.399 32.259 i.nn 2} ,24 


N 

c 
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ATCM 


1 63 Q C PHE A 188 


54.638 34.080 33,095 


1.00 21.39 


C 




ATCM 


1-631 Q PHE A IBB 


54.394 35.126 33.62S 


1.00 23.90 






ATCM 


1532 CB PHE A 188 


52.723 32.466 33.077 


-UtOO l? t ?5 


9 

c 




ATOM 


1633 CG PHE A 188 


51.389 32.215 32.435 


1.00 22.28 


c 


5 


ATCM 


1£U CD 1 PHE A 188 


50.440 33.229 32.375_ 


1.00 19.42 


c 




ATCM 


1635 CD2 PHE A IBB 


51.144 31,038 31,734 


1.00 23.82 


c 




AI9M 


1636 CE1 PHE A 188 


49.191 33.026 31.747 


1.00 24.77 






ATOM 


1637 CE2 PHE A 188 


.. 49.936 30.826 31.057 


1.00 20.17 


C 




AISM 


_A638 CZ PHE A 188 


48.945 31.815 31.068 


1.00 23.14 


C 

c 


10 


ATCM 


1639 M HIS A 189 


. 55.831 33.513 33.118 


1.00 21.15 






ATCM 


^640 CA HIS A 189 


56.933 34.122 33.837 


1.00 28.79 


K 

c 




AISM 


1641 C HIS A 1B9 


57.303 35.506 33.31* 


1.00 28.58 






ATOM 


1642 O HIS A 189 


57.480 36.463 34.081 


1.00 20.07 


c 




AICM 


1613 CB HIS A 189 


58.148 33.268 33.641 


1.00 31.38 


o 
c 


15 


ATOM 


1644 CG HIS A 189 


59.364 33.844 34.290 


1.00 29.98 


c 




ATOM 


1645 ND1 HIS A 189 


_ 59.548 33.833 35.658 


1.00 31.00 






ATCM 


1646 CD2 HIS A 189 


60.449 34.464 33.76^ 


1.00 21.79 


N 




ATCM 


„16_47 CE1 HIS A 189 


_ 60.722 34.371 35.941 


1.00 24.04 


c 




ATCM 


^£48 ME2 HIS A 189 


61.257 34.815 34.821 


1.00 19.53 


C 


20 


ATCM 


1649 N GLU A 190 


_ 57.539 35.561 32.006 


1.00 28.43 


N 
H 




ATCM 


_J.650 CA GLU A 190 


57.976 36.816 31.32,4 


1.00 27.72 


c 




ATCM 


1651 C GLU A 190 


_ 56.725 37.B29 31.437 


1.00 32.56 






AT«1 


_1£52 O GLU A 190 


56.949 38.995 31.717 


1.00 27.06 


o 




ATCM 


_J1£5J CB GLU A 190 


_.58.122 36.529 29.849 


1.00 28.55 




25 


ATCM 


_JJ54 CG GLU A 190 


52.150 35.461 29.614 


1.00 35.29 


c 




ATCM 


„1655 CD GLU A 190 


60.553 35.941 29.892 


1.00 99. Bl 


c 
c 




ATCM 


1656 OE1 GLU A 190 


60.913 36.037 31. 085 


1.00 86.56 


o 




ATCM 


_JL657 OE2 GLU A 190 


6JU293 36.167 28.910 


1.00100.00 






ATCM 


_1£5_8 N ALA A 191 


55.493 37.391 31.196 


1.00 32.67 


o 


30 


ATCM 


1659 CA ALA A 191 


54.349 38.286 31.311 


1.00 25.30 


ft 




ATCM ._ 


__1660_ C ALA A 191 


.54.287 38.795 32.742 


1.00 36.20 


c 

c 




ATCM 


JL€£1 0 ALA A 191 


53.920 39.924 33.014 


1.00 27.52 






ATCM 


_A662 CB ALA A 191 


.52,055 37.563 31.000 


1.00 16.48 


0 
C 




ATCM 


_1£6_3 N THR A 192 


_.. 54,549 37.927 33.693 


1.00 29.39 




35 


ATCM 


_L664 CA THR A 192 


... 54.395 38.386 35.041 


1.00 19.08 


fl 

c 




ATCM 


1665 C THR A 192 


55.420 39.494 35.298 


1.00 44.78 






ATCM 


1666 O THR A 192 


55.094 40.550 35.839 


1.00 40.58 


c 




ATCM 


_1667 CB THR A 192 


54.515 37.235 35.983 


1.00 18.99 


0 






166? OG1 THR A 192 


53.410 36.348 35.755 


1.00 34.36 


c 

0 


40 


ATCM 


.1669 CG2 THR A 192 


_54.461 37.738 37.425 


1.00 21.15 






ATCM 


N ALA A 19.1 


56.617 39.312 34.757 


1.00 4B.5B 


c 




ATCM 


J^LL CA ALA A 193 


_^5.7.7Q5 40.286 34.905 


1.00 50.59 


N 




ATCM 


JL6.72 C ALA A 193 


__5JL496 41.613 34.145 


1.00 54.42 


c 




ATCM 


.1673 0 ALA A 193 


57.952 42.698 34.553 


1.00 4B.2B 


.. c 


45 


ATCM 


.162.4 CB ALA A 193 


_^3,Q47 39.640 34.496 


1.00 51.78 


9 

c 
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ATCM 1675 W <?LW A 194 56.919 11.539 33.P22 1-QQ *3,*6 K 

AXflM 1676 CA GLM A 194 §€.566 42,722 32.242 1.00 36.03 £ 

ATOM 1677 C G1K A 194 55.264 43.389 32.576 1.00 40.85 £ 

Ami i£7J Q SLN A 194 54,939 11,291 31.945 1,P9 51.2? Q 

5 ATOM 1679 CB GLH A 194 56.599 42 .358 30.750 1.00 35.96 £ 

AlfiM 1680 CG GUI A 194 57.910 41.692 30.290 1.00100.00 £ 

ATOM. 1681 CD GLN A 194 57.715 40.661 29.15B 1.00100.00 £ 

AX£tt 1692 OBI gMf A 134 56.61? 4Q.546 29.57? 1-99109, po o 

ATCM 1693 WE2 gLW A 191 5 8,79 ? 39,904 29.949 1,99199, PP U 

10 ATCM 1694 N ?LY A 195 54.593 12,949 33.63P 1,00 33 T 29 & 

AICH 1685 CA GLY A 195 53.236 43.464 33.864 1.00 36.26 £ 

ATOM 1686 C GLY A 195 52.299 43.332 32.593 1.00 45.33 £ 

ATOM 1687 O gLY A IPS 51.515 44.242 32.346 1.00 45.16 O 

ATOM 1688 N GLY A 196 52.405 42.245 31.788 1.00 36.33 ft 

15 atom. 1689 CA GLY A 196 51.515 41.965 3P.6P9 1,PP 19,96 £ 

Aiai 1690 c gly A 196 59,937 41t959 31.117 l.QQ 22,49 £ 

ATCM 1691 O GLY A 196 49.724 41.479 32.223 1.00 33.09 Q 

ATCM 1692 N PRO A 197 49.144 42.657 30.431 1.00 29.22 H 

ATCM 1693 CA PRO A 197 47.790 42.732 30.953 1.00 25.29 £ 

20 ATOM 1694 C PRO A 197 17,991 41.413 39.674 LOO 24.64 £ 

AI2H 1695 O FRQ A 197 46.192 40.991 31.411 1.00 24.75 Q 

ATCM 1696 CB PRO A 197 47.162 43.911 30.176 1.00 26.31 £ 

ATCM 1 697 CG p ro ft 1 97 49 .199 4 4 ,497 29.252 1,99 26.56 £ 

ATCM 1699 Cg PRO A 197 49.397 43.454 29.293 1,09 39,25 £ 

25 atom 1699 m asp a 19B 47,572 49-723 29,658 1,99 16,99 S 

Aigu 1709 OA ASP A 198 17,967 ?9,119 29.495 1,99 21.65 £ 

ATOM 1701 C ASP A 198 48.046 38.522 28.677 1.00 31.28 £ 

ATCM 1702 O A SP A 198 49.062 38.978 28.172 1.00 34.57 O 

Aicn 1793 CR ASP A 198 15.739 39,597 29,669 1,90 32.99 £ 

30 atcu 1794 cg Asp a 198 15,968 40.055 27,256 ; t oq 16-13 £ 

ATOM 1705 OD1 ASP A 198 46.982 40.230 26.725 1.00 57.45 O 

AXOi 1796 QP2 ASP A 199 41.917 19.271 26.649 1,99 67.61 2 

ATCM 1707 N V AL A 199 47.713 37.234 28.614 1.00 38.67 N 

AICH 1708 CA VAL A 199 48.499 36.226 27.901 1.00 27.79 £ 

35 ATCM 1709 C VAL A 199 47.462 35.469 27.065 1.00 25.88 £ 

A1£M 1719 o VAX A 199 46,469 35,923 27.599 1.99 24.22 2 

ATOM 1711 CB VAL A 199 49,163 35.229 28,90 5 1.09 21-37 £ 

ATCU 1712 CS1 VAL ft 199 49.874 34.047 28.160 1.00 20.28 £ 

AXPM. 1713 CS2 VAL A 199 59.121 35.942 29.935 1,99 22,25 £ 

40 AJCH 1714 N VAL A 200 47.661 3 5.386 25.757 1.00 23.72 N 

&X£H 1715 CA VAL A 200 46.701 34.694 24.903 1.00 23.99 £ 

ATOM 1716 C VAL A 200 47.167 33.286 24.499 1.00 22.85 £ 

ATOM 1717 9 VAL A 209 49.321 33.199 24.199 1-00 29,77 S 

ATOM 1719 CB VAL A 299 46.359 35,549 23,699 1,99 23.11 £ 

45 ATOM 1719 CGI VAL A 200 45.561 34.737 22.59B 1.00 16.25 £ 
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ATOM 1720 CG2 VAL A 200 45.652 36,923 24.139 1.99 27, 86 £ 

ATOM 1721 N VAL A 201 46.296 32.278 24.632 1,99 27,39 H 

ATOM 1722 CA VAL A 201 16.588 39.893 24.265 1.99 9.63 £ 

ATCM 1723 C VAL A 201 45.653 30.529 23.165 1.00 19.63 £ 

5 atom 1724 o VAL A 201 44.452 30.755 23.312 l.QQ 17.61 Q 

ATOM 1725 CB VAL A 201 16.396 2?, 952. 25.42$ 1.99 19.95 £ 

ATOM 1726 CGI VAL A 201 46.703 28.519 25.054 1.00 20.85 £ 

ATOM 1727 CG2 VAL A 201 47.0B6 30 .439 26.661 1.00 16.73 £ 

ATCM 172B N TOP A 202 46.210 30.080 22.030 1.00 14.36 H 

10 ATOM 1729 CA TRP A 202 45.422 29.693 20.865 1.00 18.97 £ 

atom 1730 c top a 202 44,4,95 2,8,572 21,313 1,99 36.22 £ 

ATCM 1731 O TRP A 202 44.934 27.694 22.057 1.00 31.46 Q 

ATOM 1732 CB TRP A 202 46.292 29.055 19.823 1.99 19,11 £ 

ATCM 1733 CG TRP A 202 47.243 29.894 19.066 1.00 33.65 £ 

15 atcm 1734 cm top a 202 48,391 29.463 16.429 1.99 35.29 £ 

atom 1735 CD2 TRP A 202 47.126 31.282 18.772 1.00 39.90 £ 

ATCM 173 6 NE1 TRP A 202 48.941 30.481 17.693 1,99 37,86 fi 

ATCM 1737 CE2 TRP A 202 48.228 31.624 17.922 1,00 38. 35 £ 

atcm 1738 cg3 top a 202 46,296 32,281 19.139 1.99 39,39 £ 

20 atom 1739 cz2 top a 202 49,380 32,984 17.367 1-99 36.15 c 

ATOM 1740 CZ3 TOP A 202 46.356 33.542 18.578 1.00 39.60 £ 

ATOM 1741 CH2 TRP A 202 47.428 33 .828 17.6B4 1.00 40.99 £ 

ATOM 1742 N GLY A 203 43.245 28.564 20.842 1.99 25.59 M 

ATOM 1743 CA GLY A 203 42.332 27.483 21.169 1.00 13.09 £ 

25 atcm 1744 c GLY a 293 41,269 27,813 22.193 1.99 21.12 £ 

ATOM 1745 O GLY A 203 41.340 28.815 22.886 1.00 22.66 O 

Aiaj 1746 n SER h 294 49.279 26.919 22.262 1.90 16.89 N 

ATOM 1747 CA SER A 204 39.163 26.979 23.192 1,99 l«-36 £ 

ATCM 1748 C SER A 204 39.561 26 .664 24.659 1,99 22-07 £ 

30 ATCM 1749 O SER A 204 38.888 27.096 25.604 1.00 34.39 Q 

ATCM 1750 CB SER A 204 38.053 25.998 22.740 1.00 9.99 £ 

ATOM 1751 OS SERA 204 38.237 24.695 23.291 1.00 16,37 Q 

ATOM 1752 N GLY A 205 40 .562 25.813 24.854 1.00 12.42 H 

ATOM 1753 CA GLY A 205 40.963 25.411 26.208 1,99 11,64 £ 

35 ATCM 1754 C GLY A 205 40.20B 24.178 26.711 1.00 19.49 £ 

ATOM 1755 O GLY A 205 40.422 23 .723 27.838 1.99 13,59 2 

ATOM 1756 N THR A 206 39.292 23 .683 25.881 1.00 15.38 H 

atom 1757 ca thr a 206 38,132 22,591 26.291 1,99 19,89 £ 

ATCM 1758 C THR A 206 39.056 21.221 26.154 1.00 26.39 £ 

40 ATCM 1759 O THR A 206 38.564 20.267 26.737 1.00 23.28 Q 

ATOM 1760 CB THR A 206 37.124 22 .562 25.460 1.00 12.86 £ 

ATCM 1761 OG1 THR A 206 37.43B 22.395 24.082 1.00 13.12 Q 

ATOM 1762 CG2 THR A 206 36.348 23.840 25.629 1.00 10.62 £ 

ATOM 1763 N PRO A 207 40.101 21.083 25.354 l.QQ 21.10 U 

45 ATCM 1764 CA PRO A 207 40.658 19.743 25.175 1.09 18.15 £ 
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ATCM 1765 C PRO A 207 41.316 19 -1B1 26.423 1,Q9 21,7$ £ 

ATOM 1766 o PRO A 207 11,951 19,9 2 5 2 7, 2 15 1,00 29.65 2 

ATCM 1767 CB PRO A 207 19.909 24.013 1.00 17,51 £ 

ATOM 1768 CG PRO A 207 41.146 21.213 23.307 1.00 21.45 Q 

5 ATOM 1769 CD PRO A 207 40.698 22.062 24.431 1.00 23.44 £ 

ATOM 1770 M MET A 20B 41 .112 17.876 26.624 1.00 15.60 H 

ATOM 1771 CA MET A 20B 41-694 17.167 27.775 1.00 22.94 £ 

ATOM 1772 C MET A 20B 43.058 16.427 27.579 1.00 21, 9Q £ 

ATOM 1773 O MKT A 208 43.248 15.677 26.633 1.00 23. 16 2 

10 ATOM 1774 CB MET A 208 40.645 16.273 28.386 1.00 32.86 Q 

ATOM 1775 CG MET A 208 39.630 17.057 29.223 1.00 46.17 £ 

ATOM 1776 SD MET A 208 38.301 15 -990 29.826 1.00 57.85 £ 

ATOM 1777 CE MET A 208 37.999 15.028 28.343 1.00 58.23 £ 

ATOM 1778 N ARG A 209 44.022 16. 681 28.456 1.00 17.75 H 

15 ATOM 1779 CA ARG A 209 45.318 16.042 28.324 1.00 19.88 £ 

atom 1780 c arg a 209 45,971 15,534 29,639 1.00 16.92 £ 

ATOM 1781 Q ARG A 209 45.433 15.946 30.697 1,00 16,59 Q 

ATOM 1782 CB ARG A 209 46.340 16.963 27.658 1.00 21. Q7 £ 

ATOM 17B3 CG ARG A 209 45.980 17,479 26.275 1.00 22,57 £ 

20 ATOM 1784 CD ARG A 209 45.833 16.357 25.292 1.00 29,26 £ 

atom 1785 mk arg a 209 45.596 16.919 23.906 1.00 23.15 H 

atom 1786 cz arg a 209 44.420 16.742 23.267 1.00 34.52 £ 

atom 17B7 mhi arg a 209 43,336 16.267 23.690 1.00 18.03 M 

ATOM 178B MH2 ARG A 209 44.339 17.175 22.012 1.00 29.79 H 

25 ATOM 1789 M GLU A 210 46.878 14.675 29.547 1.00 20,87 M 

ATOM 1790 CA GLU A 210 47.530 14.079 30.720 1.00 17.37 C 

ATOM 1791 C GLP A 210 4 9.031 14.490 30.851 1.00 20.96 £ 

ATOM 1792 Q GLU A 210 49.748 14 -622 29.841 1.00 22.44 Q 

ATOM 1793 CB GLU A 210 47.400 12,562 30.571 1,00 16-26 £ 

30 ATOM 1794 CG GLU A 210 47.807 11.785 31.809 1,00 19.91 £ 

ATOM 1795 en GLU A 210 48.057 10.304 31.531 1.00 27.91 £ 

ATOM 1796 OBI GLU A 210 48.111 ? T 9]9 30,343 1-Q0 17. 2^ Q 

ATOM 1797 OE2 GLU A 210 4B.268 9,540 32.494 1.00 21.63 Q 

ATOM 1798 N PHE A 211 49.504 14.712 32.084 1.00 14.02 H 

35 ATOM 1799 CA PHE A 211 50.8B7 15.159 32.353 1.00 17.48 £ 

ATOM 1800 C PHE A 211 51.458 14.414 33.531 1.00 33.62 £ 

ATOM 1801 Q PHE A 211 50.716 14.031 34.443 1,00 27,96 Q 

ATOM 1802 CR PHE A 211 50.933 16.677 32.644 1,00 17.78 £ 

ATOM 1803 CG PHE A 211 50.303 17,490 31,541 1.00 21.49 £ 

40 atom 1804 CD1 PHE A 211 51.009 17.676 30.320 1.00 17.36 £ 

atom 1805 CD2 phe a 213 49,933 17,944 31,619 1-90 15.09 £ 

atom 1806 cki phe a 211 50,399 19.334 29.237 1.00 16,37 £ 

atom 1807 ce2 phe a 211 49.299 1 9.491 30,533 1 ,QQ 9.61 £ 

ATOM 1808 r* PHE A 211 49.053 18.756 29.344 1,00 12,71 C 

45 atom 1809 w leu a 212 52,761 14.161 33.495 1.00 23.76 H 
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ATOM 1810 CA LEU A 212 53.405 13.448 34.603 1.00 21.24 £ 

ATOM 1811 C LEO A 212 51,772 11,053 31,999 l.QQ H.QQ £ 

ATOM 1812 Q LED A 212 55.519 14.398 33.985 1.00 13.99 Q 

ATOM 1813 CB LED A 212 $3,54,8 11,951 31.291 1.00 21.52 £ 

5 AT CM 1814 CG LEO A 212 51.033 11.039 35,4.06 1,00 21.09 £ 

ATOM 1815 CD1 LEU A 212 52.866 10.634 36.280 1.00 20.84 £ 

ATOM 1816 CD2 LEU A 212 54.768 9. B29 34.832 1.00 13.18 £ 

atom 1817 n his a 213 55.023 11.302 36.175 1.00 9. 6Q H 

ATOM 1818 CA HIS A 213 56.290 14.864 36.555 1.00 13.66 £ 

10 ATOM 1819 C HTS A 213 57.380 13.828 36.293 1.00 20.37 £ 

ATOM 1820 O HIS A 213 57.238 12.614 36.542 1,00 16.09 Q 

ATOM 1821 CB HIS A 213 56,280 15.250 39,002 1,00 19,72 £ 

ATOM 1822 CG HIS A 213 57.491 16.017 38.408 1.00 21.22 £ 

ATOM 1823 ND1 HIS A 213 58.703 15.406 38.656 1.00 24.29 N 

15 ATOM 1824 CD2 HTS A 213 57.716 17.353 38.499 1.00 23.67 £ 

ATOM 1825 CE1 HIS A 213 59.615 16.331 38.917 1.00 19.13 £ 

ATOM 1826 NE2 HIS A 213 59.041 17.523 38.847 1,QQ 21.9? N 

ATOM 1827 N VALA 214 58.459 14.295 35.698 1.00 21.07 H 

Alfli 182B ca val a 214 53.532 13.393 35.361 1.00 19,23 £ 

20 ATOM 1829 C VAL A 214 60.067 12 .523 36.551 1.00 27.20 C 

ATOM 1830 O VAL A 214 60.604 11.444 36.359 1.00 22.23 Q 

ATOM 1831 CB VAL A 214 60.625 14.125 34.566 1.00 11.84 C 

ATOM 1832 CGI VAL A 214 61.390 15.199 35.485 1.00 8.52 £ 

ATOM 1833 CG2 VAL A 214 61.560 13. 09? 33.902 1.00 12.39 £ 

25 ATOM 1834 N ASP A 215 59.893 12.984 37.790 1.00 25.29 E 

ATOM 1835 CA ASP A 215 60.406 12.228 38.936 1.00 18.19 Q 

ATOM 1836 C ASP A 215 59.530 11.023 39.230 1,00 13,65 C 

ATOM 1837 O ASP A 215 59.988 9,991 39 T 666 1,QQ 17.11 2 

ATOM 1838 CB ASP A 215 60.575 13.129 40.155 1.00 16.27 £ 

30 ATOM 1839 CG ASP A 215 61.859 13.979 40.068 1.00 30.73 £ 

ATOM 1840 OD1 ASP A 215 62.782 13-611 39.309 1-00 23.02 O 

ATOM 1841 OD2 ASP A 215 61.957 15.029 40.730 1.00 26.00 Q 

ATOM 1842 N ASP A 216 58.276 11.136 38.863 1.00 20.08 H 

ATOM 1843 CA ASP A 216 57.378 10.017 39.016 1-OQ 18,78 C 

35 ATOM 1844 C ASP A 216 57.761 9.093 37.991 1-Q0 33,56 £ 

ATOM 1845 O ASP A 216 57.715 7.990 39.026 1,QQ 20,79 Q 

ATOM 1846 CB ASP A 216 55.912 10. 457 38.821 l.QQ 17, 19 £ 

atom 1847 cg asp a 216 55,193 10.757 10.162 1,QQ 39.0? £ 

ATOM 1848 OD1 ASP A 216 55.503 10.119 41.223 l.QQ 26 t Q2 O 

40 atom 1849 OD2 asp A 216 51.219 11.597 10.121 1.00 25.11 Q 

ATOM 1850 K MET A 217 58.092 9.653 36.755 1.00 18.11 H 

ATOM 1851 CA MET A 217 58.394 8.785 35.636 1.00 22.41 £ 

ATOM 1852 C MET A 217 59.572 7.942 35.992 1.00 27.54 C 

atom 1853 o met A 217 59.579 6,752 35.710 1,00 20.96 2 

45 ATOM 1854 CB MET A 217 58.637 9.592 31.315 1 ■ QQ 21.21 £ 
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ATCM 1855 CG MKT A 217 59.47B 8.318 33.287 1,09 IS, 37 £ 

ATOM 1B56 SP MET A 217 58.962 7.412 32.473 1,00 30,51 S_ 

ATCH 1857 CE MET A 217 57.465 7. €09 32.391 l.OQ 19.57 £ 

ATCM 1858 H ALA A 218 60.561 8.562 35. 623 l.QQ 1?,Q9 U 

5 ATCH 1859 CA ALA A 218 61.774 7,911 37,002 ItOO 13, §5 £ 

ATCM I860 C ALA A 218 61.436 6.778 36.028 1,00 22,61 £ 

ATOM 1B61 O ALA A 218 61.934 5 , 67Q 37,967 1,00 19.36 2 

ATOM 1B62 CB ALA A 21B 62.809 8,780 37,57? ItOO 12,23 £ 

aasaj 1863 W ALA A 219 60.605 7.109 39,000 1.00 19.34 H 

0 ATCM 1B64 CA ALA A 219 60.310 6.105 40. 023 1,QQ 18,01 £ 

Aiai 1865 C ALA A 219 59.630 4,901 39,413 ItQP 23,57 £ 

atcm 1866 o ala a 219 59.781 3.777 39.899 1,00 22.73 £2 

ATCM 1867 CB ALA A 219 59.387 6,678 11,083 1,00 10,11 £ 

ATCM 1868 N ALA A 220 58.753 5.174 38.454 1.00 18.99 H 

5 AEM 1869 CA ALA A 220 57.905 4,159 37.955 1.00 14,1? £ 

ATCM 1870 C ALA A 220 58.753 3.213 37,034 1,00 25.33 £ 

ATCM 1871 9 ALA A 220 58,584 2,006 37.114 l.QQ 20.63 Q 

ATCM 1872 CB ALA A 220 56.796 4.798 37.023 1,00 £ 

ATCM 1873 H SER A 221 59.770 3,772 36,37? 1,QQ 23,92 U 

0 ATCM 1874 CA SER A 221 60.702 3.011 35.556 1.00 18.38 £ 

ATCM 1875 C SER A 221 61.537 1.989 36.353 1.00 20.90 £ 

ATCM 1876 O SER A 221 61.683 0. 799 35.983 1.00 19. B4 Q 

ATCM 1877 CB SER A 221 61.604 3. 985 34.B04 1.00 10.67 £ 

ATCM 1878 OG SER A 221 60.847 4 744 33.967 1,00 15.61 Q 

5 ATCM 1879 N ILE A 222 62.083 2.476 37.463 1.00 18.12 H 

ATCM 1880 CA ILEA 222 62.866 1.644 38,381 1,00 21,56 £ 

AIGM 1881 C ILE A 222 62,020 0.554 39,068 1,00 29,10 c 

ATCM 1B82 O ILE A 222 62.504 -0.566 39.307 1.00 19.03 Q 

ATCM 1883 CB ILE A 222 63.467 2.516 39,432 1 T OP 24,56 £ 

0 ATCM 1BB4 CGI ILE A 222 64.465 3.473 38.765 1.00 32.13 C 

ATCM 1885 CG2 ILE A 222 64.129 1,671 40,500 1,00 28.26 £ 

ATCM 1886 CD1 ILE A 222 64.973 4,585 39.649 1,QP 15,61 £ 

AXCH 1987 N HIg A 223 60.772 P.9Q7 39,394 1,00 19,34 1* 

ATCM 1888 CA HIS A 223 59.829 -0.031 39.99 6 1.00 20.46 C 

5 ATCM 1889 C HIS A 223 59.599 -1.097 38.964 1.00 24.82 £ 

ATCM 1890 O HIS A 223 59.723 -2.283 39.270 1.00 24.66 Q 

ATCM 1891 CB HIS A 223 58.465 0.637 40.359 1.00 19.53 C 

ATCM 1892 CG HIS A 223 57.373 -0.333 40.759 1.00 28.64 £ 

ATCM 1893 HP1 HIS A 223 57,921 -Q.564 12,092 1,00 24,16 H 

0 ATOM 1894 CD2 HIS A 223 56.497 -1.062 40.004 1 , 00 30.39 £ 

ATOM 1B95 CE1 HIS A 223 55.983 -1.399 42.112 1.00 30.39 £ 

ATCH 1896 HE2 HIS A 223 55,652 -1-727 40.869 1-00 28.13 N 

ATCM 1897 N VAL A 224 59.354 -0.684 37.725 1.00 22.06 H 

ATCM 1B98 CA VAL A 224 59.111 -1.657 36,652 1.00 19.15 £ 

5 AXCfi 1899 c VAL a 224 60,350 -2,490 36.333 1.00 25.89 £ 
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AEM 1900 0 VAL A 221 $0,292 -3,709 36.259 l.QQ 22,37 Q 

AXflM 1901 CB VAL A 221 58,559 -1,022 35.377 l.QQ 22,59 £ 

AXOi 1902 C51 VAL A 221 58,512 -2,050 31.231 1.00 22,61 £ 

AT£M 1903 CG2 VAL A 224 57.161 -0.491 35.650 1.00 23.44 £ 

5 AX£M 1904 N MET A 225 61.499 -1.838 36.255 1.00 27.83 H 

ATOM 1905 CA MET A 225 62.710 -2.577 36.004 1.00 23.69 £ 

AJ21 1906 C MET A 225 62.896 -3.678 37.071 1.00 31.95 £ 

AEM 1907 0 MET A 225 63,290 -1.805 36.785 1,00 21,33 2 

AISM 120J6 CB MET A 225 63,902 -1.601 36.056 1,Q0 21.31 £ 

10 AJSM 1909 CG MET A 225 65.295 -2.296 35.999 1.00 17.83 £ 

A3SM 1910 SD MET A 225 65.750 -2.958 34.306 1.00 23.33 S_ 

AEM 1911 CE MET A 225 67.080 -1.896 33.785 1.00 16.46 £ 

AX21 1912 N GLV A 226 62.644 -3.319 38.316 1.00 19.54 H 

ATOM 1913 CA GLU A 226 62.988 -4.161 39.428 1.00 21.58 C 

15 AXCH 1214 £ glV A 226 61.999 -5.200 39.918 1.00 30.77 £ 

AXSH 1215 Q GLU A 226 62.308 -6.012 40.780 1.00 29.39 Q 

A1CU 1916 CB GLU A 226 63.613 -3.323 40.547 1.00 20.47 £ 

ATCM 1917 CG GL7 A 226 61,937 -2,673 10,122 1.00 23-03 £ 

ATOM 1918 CD GLU A 226 65.504 -1.809 41.208 1.00 32.62 £ 

20 fiXOi 1919 OE1 GLU A 226 64.721 -1. 455 42.122 1.00 26.12 £ 

ATCK 1920 <?E2 GLU A 226 66.711 -1.479 41.152 1.00 17.67 Q 

Aiai 1921 W LEV A 227 60.837 -5.248 39.295 1.00 34.11 K 

AISM 1922 ca LEV a 227 59.803 -6,296 39,612 ItQQ 35,26 £ 

AlflM 1923 C L EU A 227 60.537 -7.644 39.320 l t QQ 2 7,91 £ 

25 ATCK 1921 9 LEO A 227 61.291 -7.766 38,310 1,00 19,89 Q 

Aiai 1925 CB LEV A 227 58,693 -6,236 38.678 1,00 36.18 £ 

ATOM 1926 CG LEU A 227 57.381 -5.569 38.955 1.00 40.30 £ 

ATCtt 1927 CD1 LEU A 227 57.697 -4.194 39.382 1.00 42.04 C 

AlfiM 1928 CD2 LEU A 227 56.610 -5.577 37.647 1.00 46.21 C 

30 ATOM 1929 N ALA A 228 60.026 -8.68B 39.955 1.00 27.15 £ 

AISM 193Q CA ALA A 228 60.125 -10,051 39.616 1,90 25.26 £ 

A12M 1931 £ ALA A 228 59.801 -10.135 38,279 1,00 27.93 £ 

ATCM 1932 o ALA A 22« 58,621 -1Q T Q93 37,931 1-QQ 31,26 q 

ATOM 1933 CB ALA A 228 60.003 -11.052 40.703 1.00 22.05 £ 

35 AJSH 1934 N HIS A 229 60.624 -11.160 37.539 1.00 27.05 

A1£H 1935 CA HIS A 229 60.275 -11.605 36.222 1.00 24.42 C 

AXOl 1936 c HI? A 229 58,905 -12,260 36,181 1-00 2lr71 £ 

ATCK 1937 o HIS A 229 58,015 -11,851 35.398 l.QQ 22,22 Q 

AXSH 1939 CB HIS A 229 61. 351 -12,520 35.6?8 l.QQ 17.71 £ 

40 AJCH 1939 CG HIS A 229 61.284 -12.701 34.220 1.00 27.24 £ 

ARK 1940 HD1 HIS A 229 61.060 -11.650 33.350 1.00 34.38 tf 

AlCM 1941 CD2 HIS A 229 61.292 -13.821 33.465 1.00 31.45 £ 

&JCH 1912 CB1 HIS A 229 60,992 -12,113 32.115 1.00 30.50 £ 



AXfitl 1913 HE2 His a 229 61.121 -13,127 32.159 1,00 35,23 n 

45 AXOH 1944 H GLU A 230 58.681 -13.161 37.140 1.00 20.24 N 
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AXfiM 1945 CA gLV A 23Q 57.425 -13.895 37.209 1.00 29.41 £ 

AXSM 1946 C GLU A 230 56.181 -13.051 37.341 1.00 22.20 £ 

ATOM 1947 g gLV A 23Q 55.159 -13.359 36.67 9 1.00 17.78 2 

A3SM 1948 CB GLD A 230 57.464 -14.997 3S. 274 1.00 38.51 £ 

5 A22I 191? W gLV A 230 58.085 -14.582 39.567 1.00 63.09 £ 

AICM 1950 CD gLV A 230 57.036 -14.473 40.661 1.00100.00 £ 

Aiai 1951 QE1 GLV A 230 55.859 -14.872 40.400 1.00100.00 Q 

ATOM 1952 QE2 gLV A 230 57.409 -14.003 41.768 1.00 81.48 Q 

Aiaj 1953 N VAL A 231 56.272 -12.004 38.182 1.00 16.53 H 

10 AT£M 1954 CA VAL A 231 55.202 -11.029 38. 356 1.00 20.23 £ 

AEM 1955 C VAL A 231 55.009 -10.164 37.1 02 1.00 24.45 £ 

AXOJ 1956 O VAL A 231 S3. 864 -9.834 36.705 1.00 21.00 Q 

AXGM 1957 CB VAL A 231 55.541 -10.057 39.426 1.00 28.61 £ 

AEM 1959 £G1 VAL A 231 54.362 -9.098 39.610 1.00 29.78 £ 

15 AlfiM 1959 CG2 VAL A 231 55.881 -10.757 40.677 1-00 28.96 £ 

AI£H I960 H TRP A 232 56.133 -9. 798 36.486 1.00 17.17 N 

Aiai 1961 CA TRP A 232 56.052 -9.044 35.262 1.00 21.52 £ 

AXSM 19_£2 £ TRP A 232 55.388 -9.844 34.156 1.00 20.53 £ 

ATCM 126J Q TRP A 232 54.588 -9. 306 33.380 1.00 24.31 Q 

20 AJ£H 1964 CB TRP A 232 57.438 -8.644 34.801 1.00 29.88 £ 

AXSM 1955 Cg TRP A 232 57.430 -7.843 33.500 1.00 27.65 £ 

AXSM 1966 CP1 TRf A 232 57.184 -6.464 33.356 1.00 25.42 £ 

AX£M 1967 CD2 TRP A 232 57.714 -8.336 32.169 1.00 27.75 £ 

AI£M 1968 WEI TRf A 232 57.325 -6.095 32.033 1.00 22.53 (I 

25 AXSM 1969 CE2 TRP A 232 57.655 -7.203 31.27 9 1.00 25.11 £ 

AEM 197P Cfi3 TRP A 232 58.037 -9. 603 31.640 1.00 22.72 £ 

AIOl 127J CZ2 TRP A 232 57.917 -7.316 29.879 1.00 17.23 £ 

AXSH 1972 CZ3 TRP A 232 58.238 -9.720 30.223 1.00 25.97 £ 

ATOM 1973 CH2 TRP A 232 58.154 -8.581 29.368 1.00 22.07 £ 

30 ATOM 1974 y LEU A 233 55.749 -11.121 3 4.018 1.00 23.80 £ 

AT21 197$ Cft LBV A 23? 55.141 -11.949 32.937 1.00 24.78 £ 

AICU 1976 £ LBV A 233 53.652 -12.118 33.122 1.00 24.51 £ 

AXSH 1977 p LEV A 233 52.865 -12.075 32.163 1.00 28.50 O 

AICM 1978 CB LED A 233 55.765 -13.348 32.820 1.00 26.20 £ 

35 Aiai 1979 CG LBV A 233 57.250 -13.505 32.503 1.00 19.39 £ 

AXSM 1980 CD1 LEU A 233 57.745 -14.850 33.023 1.00 19.90 £ 

AICM 1991 CP2 LBV A 233 57.561 -13.287 31.017 1.00 16.01 £ 

1982 N GLU A 234 53.298 -12.343 34.372 1.00 25.45 H 

AI2M 1993 CA gLV A 231 51.929 -12.523 34.822 1.00 30.04 £ 

40 AX01 1994 C gLV A 234 51.128 -11.319 34.367 1.00 35.69 £ 

AX33 1995 Q gLV A 234 49.926 -11.390 34.052 1.00 28.25 Q 

AICU 198 6 CB gLV A 234 52.007 -12.468 36.344 1.00 37.30 £ 

AlfiM 1997 Cg gLV A 234 50.909 -13,133 37.119 1 T QQ 45,39 £ 

AICH 12S2 CD GLU A 234 51 .112 -12.881 38.601 1.00100.00 £ 

45 ATjJj 1989 QE1 GLU A 234 52.240 -13.137 39.104 1.00 99.09 O 
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AXSM 1999 PE2 CLV A 234 59.211 -12,257 39.211 1,99199.90 Q 

AX£M 1991 W ASM A 235 51.802 -10.1B4 34.364 1.00 25.04 H 

ATPM 1222 CA ASN A 235 51.109 -8.986 33.992 1.00 26.17 £ 

AZSM 1993 C ASH A 235 51.2B0 -8.494 32.571 1.00 30.46 £ 

5 AX£M 1994 O ASM A 235 50.824 -7.393 32.259 1.00 22.90 Q 

AJ£M 1995 CB ASN A 235 51.427 -7.895 34.981 1.00 29.23 £ 

AX£M 1996 CG ASH A 235 50.878 -8.197 36.342 1.00 39.27 £ 

AX£M 1997 QP1 AgN A 235 4.9.722 -7,892 36,629 1.99 29.06 £2 

AEM 1998 KD2 ASN A 235 51.653 -8.934 37.140 1.00 40.22 N. 

10 &IOi 1999 N THR A 236 51.935 -9.268 31.708 1.00 20.97 H 

Aiai 2000 CA THR A 236 52.108 -8.795 30.344 1.00 22.30 £ 

A3SI1 2001 C THR A 236 51.867 -9.943 29.419 1.00 29.74 £ 

ATOM 2002 O THR A 236 51.551 -11.033 29.895 1.00 21.23 Q 

AX2M 299? CB THR A 236 53.545 -B.306 30.161 1.00 22.73 c 

15 Aia* 2004 QG1 THR A 236 54.422 -9.325 30.636 1.00 21.23 O 

A1SH 2005 CG2 THR A 236 53.801 -7.048 31.041 1.00 19.69 £ 

A1CH 2996 N GLN A 237 52.003 -9.699 28.109 1.00 22.23 N_ 

ATCM 2907 CA <?LH A 237 52.097 -10.7B3 27.12 2 1.00 16.69 £ 

ATOM. 2998 C GLN A 237 53.335 -10.507 26.331 1.00 21.02 £ 

20 AICH 2999 Q GLN A 237 53.729 -9. 362 26.204 1.00 22.19 Q 

AI23 2919 CB GLN A 237 50.913 -10.999 26.189 1.00 8.23 £ 

AT£3M 2011 CG GLM A 237 49.639 -11.096 26.904 1.00 21.04 £ 

AXSW 2012 CD GLM A 237 48.907 -9.862 26.606 1.00 62.07 £ 

AXaj 2913 9E1 GLM A 237 49.437 -9,712 25,469 l.oo 59.32 Q 

25 ATOM. 2914 KB2 5LN A 237 49.220 -8.847 27.388 1.00 37.82 N 

ATOM 2915 N PRQ A 239 54.002 -11.579 25.917 1.00 28.76 K 

ATOM. 2916 CA PRQ A 238 55.275 -11.438 25.246 1.00 30.28 £ 

AXOJ 2917 C PRO A 23B 55.194 -10.643 23.958 1.00 29.08 £ 

ATttt 2018 Q PRO A 238 56.181 -10.029 23.600 1.00 15.95 O 

30 AIOS 2919 CP PRO A 238 55.733 -12.879 25.011 1.00 22.54 £ 

ATCM 2020 CG PRO A 238 54.898 -13.710 25.886 1.00 18.92 £ 

AXOI 2021 CD PRO A 238 53.626 -12.998 26.068 1.00 11.75 £ 

ATOM 2922 N MET A 239 54.041 -10.635 23.28 6 1.00 17.26 N. 

AXOJ 2023 CA MET A 239 53.924 -9. 607 22.104 1.00 17.85 £ 

35 A1S2M 2924 C MET A 239 53.109 -8.509 22.362 1.00 18.63 £ 

AXOf 2925 9 MET A 239 52.792 -7.741 21.419 1.00 16.82 Q 

AX£M 2026 CB MET A 239 53.460 -10.588 20.881 1.00 15.22 £ 

AX£M 2927 Cg MET A 239 54.536 -11.534 29.251 1,99 12,90 £ 

AISM 2028 SD MET A 239 53.994 -12.534 1 8.808 1.00 17.49 g_ 

40 ATOM 2029 CE MET A 239 54.350 -11. 357 17.422 1.00 13.12 £ 

AXai 2939 N LEV A 249 52.847 -8. 252 23.646 1.00 18.55 N. 

AJCH 2931 CA LB? A 249 52.159 -7.037 24.131 1.00 16.68 £ 

ATSH 2932 C LEU A 240 52.774 -6.733 25.493 1.00 11.82 £ 

AI2S 2933 Q LEU A 240 52.124 -6.803 26.549 1.00 13.84 Q 

45 Aiai 2934 C& LEV A 249 59,515 -7.249 24.249 1,99 16,91 £ 
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ATCM 2035 CG LBV A 240 49.646 -6. 120 23.B52 1.00 22.29 £ 

AICM 2935 CD1 LB? A 240 4B.968 -5.488 25.033 1.00 25.51 C 

AXCM 2037 CD2 LBU A 240 50.070 -5.059 22.815 1.00 2B.07 £ 

Aiai 2038 N SER A 241 54.076 -6.467 25.4S6 1.00 13.09 H 

5 Aiai 2939 £A SER A 211 54.842 -6.315 26.682 1.00 24.20 £ 

ATCM 2040 C SER A 241 54.947 -4.938 27.377 1.00 30.52 C 

AlfiM 2041 Q SER A 211 55.363 -4.854 28.547 1.00 17.02 2 

ATOM 2042 CB SER A 241 56.247 -6.900 26.495 1.00 14.04 £ 

AICM 2913 qg SBR A 241 57.062 -6,14,4, 25.598 1.Q0 13,95 q 

10 AIQM 2944 W HIS A 242 54.661 -3.861 26.659 1.00 17.87 H 

AICM 2945 CA gig A 242 54.894 -2.548 27.221 1.00 13.55 £ 

AlfiM 2046 C HIS A 242 53.990 -2.254 28.373 1.00 13.70 £ 

AlfiM 2047 O HIS A 242 52.974 -2.885 28.539 1.00 13.29 Q 

AXCa 2949 Cg MS A 242 54.826 -1.430 26.13 0 1.00 16.05 £ 

15 fiiaS 2049 CG HIS A 242 53.595 -1.504 25.272 1.00 18.88 C 

AXCM 2050 HD1 HIS A 242 52.591 -0.553 25.326 1.00 23.24 H 

AICM 2951 CP2 HIS A 242 53.165 -2.461 24,413 l,Qp 13.19 £ 

AItti 2052 CE1 HIS A 242 51.629 -0.887 24.483 1.00 17.44 £ 

AX£M 2953 HE2 HIS A 242 51.962 -2.031 23.90 1 1.00 19.54 H 

20 AlfiM 2054 N I ItB A 243 54.310 -1.203 29.095 1.00 15.84 K 

ATC M 2055 CA ILE A 243 53.492 -0.809 30.192 1.00 19.10 £ 

AX£M 2956 £ ILB A 243 53.336 0.714 30.191 l.OO 23.23 £ 

AISM 2957 9 ILB A 213 54.312 1.1Q6 30.395 1.99 12.19 2 

AlfiM ZQM CB ILB A 243 54.166 -1,273 31.492 1- 99 2 4.6 ? Q 

25 AlfiM 2059 CGI ILE A 243 54.014 -2.783 31.576 1.00 25.60 £ 

ATOM. 2060 CG2 ILE A 243 53.497 -0.665 32.735 1.00 17.37 £ 

ATS*! 2061 epi ILB A 213 54,725 -3-365 32t714 1 T Q9 14.92 £ 

AlfiM 2062 N ASM A 244 1.217 30.013 1.00 16.43 H 

AlfiM 2063 CA ASK A 244 51-824 2.689 30.038 1.00 18.99 £ 

30 AlfiM 2064 C ASH A 244 52.252 3.292 31.348 1.00 18.83 £ 

AlfiM 2065 0 ASN A 244 51.965 2. 727 32.405 1.00 19.58 Q 

AXfiM 2066 CB ASH A 244 50.304 2.9B7 29.910 1.00 15.67 £ 

ATCM 2067 CG ASH A 244 49.768 2.702 28.517 1.00 14.57 £ 

AlfiM 2068 OD1 ASH A 244 50.546 2.583 2 7.580 1.00 13.64 O 

35 AlfiM 2Q69 ffl>2 ASM A 244 48.443 2.491 28.393 1.00 10.16 N 

AlfiM 2070 H VAL A 245 52.800 4.499 31.326 1.00 13.50 H. 

ATCM 2071 CA VAL A 245 53.159 5.134 32.602 1.00 13.49 £ 

AlfiM 2072 C VAL A 245 52.528 6.566 32.644 1.00 16.25 £ 

AlfiM 2973 Q VAL A 245 52.786 7.405 31.770 1.00 15.20 O 

40 AlfiM 2074 eg VAL A 245 54.754 5.163 32.810 1.00 21.07 £ 

AXfiM 2075 CGI VAL A 245 55.154 6.085 33.937 1.00 15.08 £ 

ATfiM 2076 £G2 VAL A 245 55.280 3.817 33.143 1.00 15.82 £ 

Aiai 2077 H GLY A 246 51.696 6.843 33.649 1.00 14.03 H 

AlfiM 2078 CA GLY A 246 51.027 8.136 33.707 1.00 16. B7 C 

45 AlfiM 2979 C GLY A 246 50.146 9.293 34. 93? 1.00 26.95 £ 
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AX££ 2080 O GLY A 246 50.323 7.401 35.850 1.00 23.04 Q 

AIQM. 2Q61 N THR A 247 43.297 9,161 34.963 i,qq 21,44 H 

AXSW 2982 ca, THR a 247 48,232 9,27$ 36.96? 1,09 21,3? £ 

aiSM 2983 C THR A 247 46,969 8,677 35,673 1,99 24.98 £ 

5 AXCM. 2084 O THR A 247 46.069 8.306 36. SOB 1.00 21.03 Q 

ATOM 2085 CB THR A 247 47.98B 10.730 36.404 1.00 22.24 £ 

AXOi 20B6 OG1 THR A 247 47.409 11.389 35.265 1.00 18.62 Q 

AXCM 2PP7 Cg2 THR A 247 43.275 11,376 36.724 1,99 18,99 £ 

ATCM 2999 H SLY A 249 46,59? 9.651 34.384 1.99 24.95 M 

10 AXCM. 2089 CA GLY A 248 45.319 8.143 33.924 1.00 22.61 £ 

Aiai 2999 C GLY A 248 44.223 9.160 34.226 1.00 21.42 £ 

AU3d 2931 Q GLY A 248 43.059 8.866 34.137 1.00 25.70 Q 

aaatj 2092 h val a 249 44.615 10.386 34.521 1.00 30.72 n 

ATCM 209 3 CA VM A 249 43.673 11.464 34.827 1.00 26.09 £ 

15 AEM 2094 C VAL A 249 43.747 12.596 33.7B6 1.00 32.70 £ 

AXCM 2995 o VAL A 249 44.853 13,996 33.387 1,99 26.92 Q 

AXCM 2996 CB VAL A 249 44,929 12,995 36,214 1.99 38,59 £ 

AXCM 2997 CC1 VAL A 249 43,225 13.324 36.47Q 1,99 36.11 £ 

AXCM 2998 CS2 VAL A 249 43,792 11.083 37.306 l.oo 41.30 £ 

20 JMSM 2099 N ASF A 250 42.581 13.125 33.397 1.00 27.95 K 

AXCM 2100 CA ASP A 250 42.488 14.232 32.439 1.00 20.64 £ 

AJ£U 2191 C ASP A 250 42.611 1S.5B1 33.155 1.00 27.63 £ 

AXCM 2192 0 ASP A 259 42.188 15.783 34.398 1,99 26.23 Q 

ATOM 2103 CB ASP A 250 41.075 14.302 31.827 1.00 23.89 £ 

25 ATOM 2104 CG ASP A 250 40.768 13.180 30.850 1-00 39.52 £ 

ATOM 2105 OD1 ASP A 250 41.283 13.184 29.688 1.00 39.96 Q 

AXCM 2106 OD2 ASP A 250 39.767 12.501 31.153 1.00 45.34 Q 

AXCM 2107 M CYS A 251 43.029 16.566 32.388 1.00 20.12 H 

ATOM 210 8 CA CYS A 251 42.962 17.906 32.851 1.00 27.20 £ 

30 AI£M 2109 C CYS A 251 42.918 18.779 31.577 1.00 26.47 C 

AXCM 2119 o CYS A 251 43.699 18.569 39,6?3 i r po i?.45 q 

AXCM 2111 CB CYS A 251 44.148 18.157 33.778 1.00 34.86 C 

ATCM 2112 S? CYS A 251 45.129 19.619 33.453 1-99 29,47 £ 

ATgM 2113 H THR A 252 41.932 19.673 31.494 1.00 14.85 H 

35 AX£H 2114 CA THR A 252 41.834 20.588 30.335 1.00 21.21 £ 

AXCM 2115 C THR A 252 42.999 21.592 30.236 1.00 20.53 £ 

AXCM 2116 Q THR A 252 43.657 21,926 31,249 1.99 15,24 Q 

Aiai 2117 Cg THR A 252 49.596 21.407 39.329 1,99 32.99 £ 

AXCM 2118 TO THR A 252 49,469 22.394 31.447 1-99 19,26 Q 

40 AX£H 2119 CG2 THR A 252 39.309 2 0.495 30.372 1.00 13.91 C 

AXSM 2120 N ll£ A 253 43.228 22.095 29.024 1.00 14.81 K 

AXCM 2121 CA ILEA 253 44.264 23. 118 28.812 1.00 16.90 £ 

Aiai 2122 C I LB A 253 43.934 24,393 29,627 l.po 23.41 £ 

AXCM 2123 P ILS A 253 44.834 25.912 39.247 1 T QQ 15.27 Q 

45 ATCM 2124 CB TLB A 253 44.404 23.452 27.302 1.00 24.05 C 
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atom 2125 cgi ile a 253 44.862 22,209 26,561 1.00 27.33 £ 

ATOM 2126 CG2 ILE A 253 45.473 24.479 27,077 1,00 9.22 £ 

ATCM 2127 CD1 ILE A 253 45.662 21,276 27,152 1,00 49,56 £ 

ATCM 2128 W ARG A 254 42,637 21.709 2?, 707 1.00 19.56 N 

5 ATCM 2129 CA ARG A 254 42.228 25.865 30.522 1.00 19.41 £ 

ATCM 2130 C ARG A 254 42.712 25.713 31.970 1.00 18.10 C 

atcm 2131 o arg a 254 43.311 26.616 32.515 1,00 13,89 Q 

ATCM 2132 CB ARG A 254 40.704 26,101 30.190 1.00 15.99 £ 

atom. 2133 Cg Are A 254 40.282 27.379 31,255 1,00 9.96 £ 

10 ATCM 2134 CD ARG A 254 38.B09 27.702 31.218 1.00 24.79 £ 

atcm 2135 ne arg a 254 38.198 28.414 29,997 1,00 29,42 H 

ATCM 2136 CZ ARG A 254 38.693 29. 723 29.794 1.00 59.85 £ 

ATCM 2137 NH1 ARG A 254 39.194 30 .527 30.732 l.QQ 42.58 tt 

ATCM 2138 NH2 ARG A 254 38.377 30.245 28.620 1.00 18.44 H 

15 ATCM 2139 M ASP A 255 42.406 24.564 32.586 1.00 20.22 H 

atom 214Q CA ASP ft 255 42.795 24,205 33.971 1,00 16.49 £ 

ATOM 2141 C ASP A 255 44.321 24 .372 34.069 1.00 22.43 £ 

ATCM 2142 Q ASP A 255 44.868 24.897 35.060 1.00 18.53 Q 

atcm 2143 cb asp a 255 42,478 22.686 31.157 1,00 19.17 £ 

20 ATCM 2144 CG ASP A 255 42.144 22 .246 35.610 1.00 47.08 £ 

ATCM 2145 QD1 ASP A 255 41.780 23.090 36.429 1.00 49.66 Q 

atcm 2146 OD2 asp a 255 12,020 21,016 35.880 1,00 19.12 2 

atcm 2147 m UMA 256 45.014 23.809 33,078 l.QQ 15.98 H 

ATOM 2148 CA LEU A 256 4,6,465 23.844 33.069 IrOQ 21-76 £ 

25 atcm 2149 c leu A 256 47.020 25.275 33.076 1,00 16.79 £ 

atcm 2150 o LEV A 256 47.825 25.697 33.946 1.00 15.24 2 

ATOM 2151 cb LRU A 256 46,967 23,056 31r95? 1,00 23,33 £ 

atom 2152 cg leu a 256 48,491 23.100 31,765 1.00 26.80 £ 

ATOM 2153 CD1 LEU A 256 49.171 22.334 32.984 1.00 17.13 C 

30 atcm 2154 cd 2 leu a 256 49,Q4Q 22,724 30.346 1-00 15,42 £ 

ATOM 2155 N ALA A 257 46.520 26.048 32.140 1.00 13.77 H 

ATCM 2156 CA ALA A 257 46.938 27.436 32.025 l.QQ 12,70 £ 

atom 2157 c ala a 257 46.656 28,237 33.267 1.00 10,73 £ 

ATCM 215B O ALA A 257 47.451 29.073 33.672 1.00 20.33 Q 

35 ATOM 2159 CB ALA A 257 46.208 28.073 30.B34 1.00 13.34 £ 

ATCM 2160 N GLK A 258 45.470 28.080 33.835 1.00 12.40 H 

ATCM 2161 CA GLN A 258 45.102 28,911 34,991 1,00 8.39 £ 

ATCM 2162 C GLK A 258 45.879 28.480 36.166 1.00 13.46 £ 

ATOM 2163 O GLJf A 258 46.178 29.281 37.029 1.00 22.96 Q 

40 ATCM 2164 CB GLN A 258 43.614 28 .761 35.305 1,00 16.12 £ 

atcm 2165 cg gln A 258 42.674 29,096 34.130 1,00 30.19 £ 

atcm 2166 cd gln a 258 42.574 30.585 33.781 1.00 37.29 £ 

atcm 2167 OKI gln a 258 42.911 31.471 31.610 1.00 21.24 2 

atcm 2168 NE2 gln a 258 42.021 30,976 32.572 1,00 15,94 H 

45 ATCM 2169 W THR A 259 46.179 27.182 36.232 1.00 16.21 H 



WO 99/64618 



PCT/US99/11576 



130 



AISM 2170 £ft THR A 259 46.982 26.678 37.336 1.00 16.85 £ 

AISH 2171 C THR A 259 48.410 27.186 37.233 1.00 20.56 £ 

M£M 2122 2 THR A 25? 49.002 27.621 38.214 1.00 21.44 Q 

2173 CB THR A 259 47.066 25.192 37.361 1.00 27.56 £ 

5 AXOi 2174 OGl THR A 259 45.752 24.620 37.509 1.00 20.92 Q 

ATJ24 2175 qqz TBR A 25? 17,936, 24.796 38.545 1,00 12.95 C_ 

AXfiM 2176 N ILE A 260 18.952 27.170 36.029 1.00 19.96 H 

ATCM 2177 CA ILE A 260 50.292 27.704 35.B39 1.00 23.01 £ 

Aiai 2179 C ILE A 260 50,313 29.180 36.225 1.09 31,7? c 

10 AEM 2179 Q ILK A 260 51.211 29.627 36.993 1.00 25.90 2 

AXOl 218Q CB ILE A 260 50.635 27.456 34.390 1.00 22.46 £ 

ATCf* 2181 CGI ILE A 260 51.153 25.940 34.232 1.00 24.12 C 

AXSM 2182 CC2 ILE A 260 52.099 28.361 34.106 1,00 13,47 c 

AXSM 2183 CPl ILE A 260 51.501 25.143 32.810 1,00 12,58 £ 

15 AlfiM 2181 N ALA A 261 49.280 29.910 35.764 1.00 15.35 H 

ATOM 2185 CA ALA A 261 19.177 31,355 ?§ t Q19 l.OO 16.00 £ 

AEM 2186 C ALA A 261 49.316 31.604 37.550 1.00 20. SB £ 

A35M 2187 Q ALA A 261 50.104 32.443 37.987 1.Q0 16.09 Q 

ATOM 2188 CB ALA A 261 47.832 31.958 35.487 1.00 13.65 £ 

20 fiiaj 2189 N LIS A 262 48.551 30.B43 3B.323 1.00 11.50 ]J 

Ami 2190 CA LYS A 262 49,578 30.905 39,770 1-00 10,13 £ 

ATOM 2191 C LYS A 262 49.968 30.460 40.296 1.00 28. OB £ 

Aiai 2192 9 LYS A 262 50.503 31.084 41.205 l.oo 29.37 Q 

AieM 2193 CB LYS A 262 47.453 30.032 40.335 1.00 12.50 £ 

25 ATCM 2194 CG LYS A 262 47.332 29.962 41. 8 8 9 1-00 16.51 £ 

Aiai 2195 CP LYS A 26? 46.092 29.092 42.371 1.00 46.61 C 

AT21 2196 CE LYS A 262 46.344 27.555 42.661 1.00 99.70 C 

AT9M 2197 NZ LYS A 262 45.157 26.703 43.200 1.00 36.59 U 

AXJ2I 2198 N VAL A 263 50,589 29.413 39.705 1.00 17.14 H 

30 AIGM 2199 CA VAL A 263 51,915 29,039 40,171 l t po 19 T 72 £ 

&X£U 2200 C VAL A ?63 $2.997 3Q,17Q ?9t997 1,00 32,12 £ 

AXO! 22P1 Q VAL A 263 53.871 30.412 40.834 1.00 21.18 Q 

AXSM 2202 CB VAL A 263 52.389 27.709 39.476 1.00 16.35 £ 

AlflM 2203 CGI VAL A 263 53.920 27.518 39.647 1.00 11. B3 £ 

35 AT£M 22PJ CG2 VAL A 263 51.646 26.522 40.093 1.00 14.99 C 

AXfiM 2205 W VAL A 264 52.913 30.899 38.909 1.00 21.75 £ 

A121 2206 CA VAL A 264 53.917 31,877 38.653 1,00 19.81 £ 

ATCM 22Q7 C VAL A 264 53.719 33.208 39.377 1.00 35.79 C 

AlfltJ ZZM Q VAL A 264 54.632 34.032 39.482 1.00 28.99 O 

40 ATCM 2209 CP VAL A 264 54.059 32.014 37.175 1.00 24.27 £ 

AXOJ 2210 CGI VAL A 264 54.728 33.269 36.822 1.00 33.58 £ 

ATCM 2211 CG2 VAL A 264 54.840 30.808 36.674 1.00 23.01 £ 

AXCH 2212 » GLY A 265 52.550 33.376 39.969 1.00 25.30 U 

A1£H 221? CA SLY A 265 52.241 34.620 40.636 1.00 24.14 C 

45 ATCM 2214 C SLY A 265 51.730 35. 694 39.632 1.00 35.03 £ 
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ATOM 2215 O GLY A 265 51.773 36.911 39.962 1.00 33.71 Q 

ATCM 2216 N TYR A 266 51.294 35.257 38.426 1.00 26.25 M 

ATOM 2217 CA TYR A 266 50.698 36.151 37.373 1.00 26.55 £ 

Aiai 2218 c tyr a 266 49,364 36,715 37,819 1.90 31,91 £ 

5 ATOM 2219 O TYR A 266 4 B.532 36.067 38.456 1.00 27.99 O 

ATOM 2220 CB TYR A 266 50.501 35.463 36.008 1-00 34.31 C 

ATCM 2221 CG TYR A 266 49.994 36.381 34.864 l,Qp 28.64 £ 

ATCM 2222 CD1 TYR A 2 66 50.670 37.582 34.542 1.00 35.05 C 

ATCM 2223 CD2 TYR A 266 48.860 36,038 34.118 1,99 22,60 £ 

10 ATCM 2224 CB1 TYR A 266 59.212 38.434 33,472 1,99 29,73 £ 

ATCM 2225 CB2 TYR A 266 48.428 36 .859 33.012 1.00 20.91 £ 

ATCM 2226 CZ TYR A 266 49.088 39,9 62 3 2,735 1.00 23.85 C 

ATCM 2227 OH TYR A 266 48.622 38.851 31.710 1.00 33.40 Q 

ATCM 2228 N LYS A 267 49.217 38.043 37.604 1,99 25.72 E 

15 ATCM 2229 CA LYS A 267 47.988 38.697 38.009 1.00 30.77 C 

ATCM 2230 C LYS A 267 47.217 39.280 36.798 1,99 28.85 £ 

ATCM 2231 O LYS A 267 46.179 39.894 36.949 1.00 31.17 2 

AX£M 2232 cb lys a 267 48.279 3?, 741 39,992 1,99 27.13 £ 

ATCM 2233 CG LYS A 267 48.728 39.128 49.493 1,99 23.18 £ 

20 ATCM 2234 CD LYS A 267 49.420 40.096 41.562 1.00 30.98 £ 

ATCM 2235 CE LYS A 267 47.933 39 .358 42.820 1.00 48.52 £ 

ATCM 2236 WZ LYS A 267 47.005 38.208 42.505 1.00100.00 U 

ATCM 2237 N GLY A 268 47.716 39.054 35.594 1,99 22.67 H 

atcm 2238 ca gly a 268 47.919 39.519 34.394 1,09 21,38 £ 

25 ATCM 2239 C GLY A 268 45.B56 38.56B 34.085 1,99 31-93 £ 

ATCM 2240 O GLY A 268 45.455 37.728 34.911 1,99 19,?? £ 

ATCM 2241 N ARG A 2 69 45.387 38.645 32.849 1.00 30.40 U 

atom 2242 ca arg a 2 69 44.263 37.846 32.399 1,90 26.47 £ 

ATCM 2243 C ARG A 269 44.680 36.705 31.489 1.00 22.35 C 

30 ATCM 2244 O ARG A 269 45.378 36.926 30.524 1.00 22.75 O 

ATOM 2245 CB ARG A 269 43,297 38,753 31.626 1.00 22.65 £ 

ATCM 2246 CG ARG A 269 42.201 39.390 32.463 1.00 24.21 £ 

ATCM 2247 CD ARG A 269 40.936 39.465 31.568 1,99 33t*5 £ 

ATOM 224B MB ARG A 269 40.113 40.676 31.762 1.9919Q.99 H 

35 ATCM 2249 CZ ARG A 269 38.808 40.751 31.431 1.00100.00 £ 

ATCM 2250 NH1 ARG A 269 38.201 39 .691 30.921 1.00 99. ?3 H 

ATOM 2251 NH2 ARG A 269 38.094 41.865 31.663 1,99199,09 t* 

ATCM 2252 N VAL A 270 41.195 35.494 31.758 1,99 19,87 X 

atom 2253 ca val A 270 44,469 34.389 39.856 1.99 21.92 £ 

40 ATOM 2254 C VAL A 270 43.319 34.456 29.824 1.00 22,51 £ 

ATOM 2255 O VAL A 270 42.145 34.501 30.181 l t 99 25.79 Q 

ATOM 2256 CB VAL A 270 44.436 32.979 31.571 1.00 24.03 £ 

ATCM 2257 CGI VAL A 270 44.576 31.861 39.533 lt99 29,72 £ 

atom 2258 cg2 val a 270 45-596 32,849 32,639 1.99 11-27 £ 

45 atom 2259 n val A 271 43.669 34,499 28,554 — 1-99 25,18 u 
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AXSM 2260 CA VftL ft 271 42.666 34,492 27,4B7 1 T QQ 29,3? c 

ilSH 22^1 c VftL A 271 42.91? 33.379 26,442 1,00 24.9? £ 

ATOM 2262 O VAL A 271 43.923 33.115 25.980 1.00 21-98 Q 

ftiai 226? CB VftL ft 271 42.301 35,813 26,736, i,po 29.25 £ 

5 ATOM 2264 CGI VAL A 271 42.256 35.7 73 25.370 1.00 31.91 £ 

AXOI 2265 CG2 VAL A 271 42.421 36.989 27.565 1.00 18.72 £ 

ATOM 2266 N PHE A 272 41.716 32.758 26.019 1.00 26.14 H 

ATOM 2267 CA PHE A 272 41.752 31.747 24.963 1.00 24.34 £ 

AISH 2269 c PHE A 272 41.236 32,266 23.623 1.90 29.95 £ 

10 AISM 2269 Q PHE A 272 40.155 32.926 23.592 l.QQ 22.01 Q 

ATOM 2270 CB PHE A 272 40.960 30.506 25.391 1.00 20.97 C 

ATOM 2271 CG PHE A 272 41.764 29.570 26.243 1.00 21.77 £ 

ATOM 2272 CD1 PHE A 272 41.940 29.842 27.610 l .QQ 1 4 ,60 £ 

ATOM 2273 CD2 PHE A 272 42.504 2B.550 25.656 1.00 22.19 £ 

15 ATOM 2274 CE1 PHE A 272 42.763 29.041 28.434 1.00 17.89 £ 

AXQH 2275 CE2 PHE A 272 43.336 27,726 26,454 1,00 27r64 £ 

AEH 2276 CZ PHE A 272 43.478 27.979 27.851 1,QQ 25,14 £ 

ATOM 2277 N ASP A 273 42.012 32.114 22.542 1.00 29.45 N 

ATOM 2278 CA ASP A 273 41.557 32.536 21.214 1.00 22.33 £ 

20 ATOM 2279 C ASP A 273 40.896 31. 365 20.493 1.00 25.67 £ 

ATOM 2280 O ASP A 273 41.539 30.570 19.793 1.00 17.81 Q 

AEM 22B1 CB ASP A 273 42.672 33.114 20.343 1.00 21.45 £ 

ATOM 22B2 CG ASP A 273 42.131 33.626 18.990 1.00 26.89 £ 

ATOM 2293 OP1 APP A 273 40.975 33 .249 19,599 1 ,0 0 2 7 ,76 Q 

25 ATOM 228 4 OD2 ASP A 273 42.838 34.421 18.327 1.00 30.06 O 

ATOM 2285 N ALA A 274 39.589 31,294 20,64? 1.0° 15.59 M 

ATOM 2286 CA ALA A 274 38.932 30.12B 20.12B 1.00 23.75 £ 

ATOM 2287 C ALA A 274 38.853 30.168 18.653 1.00 32.30 C 

AT9M 2288 O ALA A 274 38.284 29.256 18.029 1.00 29.37 2 

30 ATOM 2289 CB ALA A 274 37.567 29.905 20.777 1.00 18.87 £ 

ATOM 2290 N SEP, A 275 39.372 31.243 19,091 l.QQ 21,10 N 

ATOM 2291 CA SER A 275 39.343 31.288 16.631 1.00 26.90 £ 

ATOM 2292 C gfift A 275 40.390 30.300 16.116 1,00 43,3? £ 

AISW 2293 O SER A 275 40.421 2 9.949 14.927 1.00 46.32 O 

35 ATOM 2294 CB SER A 275 39.547 32.683 16 .074 1.00 15.19 £ 

ATOM 2295 OG SER A 275 40.904 33.070 16.078 1.00 28.71 O 

AX£M 2296 N LY? A 276 41,192 29,790 17,037 j.OO 2?.9B ft 

Ami 2297 CA LYS A 276 42.178 29.791 16.639 1,00 23.29 c 

ATOM 2299 c LYS ft 276 41.645 27-405 16,976 l,QQ 29,73 £ 

40 ATOM 2299 O LYS A 27 6 40.992 27.206 18.010 1.00 25.10 Q 

A1CH 2300 CB LYS A 276 43.544 29.051 17.275 1.00 19.19 £ 

AIOH 2301 CG LYS A 276 43.957 30.496 17.218 1-00 32.11 £ 

ATOM 2302 CD LYS A 276 44.062 30.852 15.79B 1,00 2 2,43 C 

ATOM 2303 CS LYS A 276 44.930 32.067 15.570 1.00 23.18 £ 

45 Axon 2304 kz lys a 276 45.454 32,117 14.15? i t qq 29,42 K 
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AIPM 2305 N PRO A 277 41.892 26.476 16.055 1. 00 36.04 

AXOi 2306 £A PRO ft 277 41.446 25.087 16.170 1.00 35.93 £ 

ATOM 2397 C PRO A 277 12. 92 2 2 1 .332 17.363 1.00 29.30 c 

AXSK 2398 P PRP A 277 43.103 24.650 17.885 1.00 30.54 Q 

5 Aiai 2309 CB PRO A 277 41.975 24.453 14.87B 1.00 39.65 £ 

ATCtt 2310 CG PRO A 277 43.249 25.261 14.566 1.00 42.90 £ 

ATOM 2311 CD PRO A 277 42.787 26.670 14.892 1.00 37.84 £ 

AXOl 2312 N ASP A 279 41.273 23.339 17.B09 1.00 22.35 

AXSH 2313 CA ASP A 279 11,745 22.501 19.993 l.QQ 22. U £ 

10 AX£M 2311 C ASP A 279 42.184 21.189 18.272 1.00 19.66 £ 

AX2M 2315 O ASP A 278 41.905 20.917 17.117 1.00 23.49 Q 

ATOM 2316 £B ASP A 278 40.636 22.241 19.971 1.00 15.09 £ 

AX2M 2317 CG ASP A 278 40.216 23.503 20.702 1.00 22.86 £ 

AT2M 2319 QD1 ASP A 278 41.113 24.254 21.096 1.00 25.18 Q 

15 Aiaj 2319 OP2 AffP ft 27B 38.999 23.7B7 20.812 1.00 39.55 O 

AXO* 232Q N GLY A 279 42.846 20.355 19.044 1.00 30.65 £ 

AIOE 2321 CA GLY A 279 43.229 19.034 18.546 1.00 33.78 £ 

AlfiM £ <HiY A 279 42.115 IB . 099 18.944 1.00 38.10 £ 

Aiai 2323 Q GLY A 279 40.963 18.517 19.068 1.00 47.52 O 

20 AXOl 2321 N THR A 289 42.419 16.839 19.177 1.00 29.44 |I 

AXSM 2325 £A THR A 289 41.328 15.990 19.587 1.00 26.68 £ 

AXSM 2326 £ THR A 299 40.889 16.439 20.972 1.00 23.52 £ 

AX21 2327 P THR A 289 41.670 17.067 21.713 l.OO 23.62 p 

ATPM 2329 CB THR A 2B0 41.695 14.492 19.540 1.00 40.78 C 

25 AlfiM 232? QGl THR A 2B0 42.889 14.272 20.296 1.00 25.56 2 

AISM 2330 CG2 THR A 280 41.893 14.054 IB. 095 1.00 37.71 £ 

AIQM 2331 N PRO A 291 39.672 16.063 21.346 1.00 25.54 J* 

Aiai U22 CA PRP A 281 39.129 16.454 22.628 1.00 25.72 £ 

ATOM 2333 C PRP A 281 39.776 15.778 23.800 l.QO 26.02 £ 

30 Aiai 2331 P PRP A 281 39.752 16.314 24.915 1.00 22.68 O 

AieSJ 2335 CB PRO A 281 37.650 15.990 22.559 1.00 28.89 £ 

ATSM 2336, £G PRp A 281 37.417 15.540 21.201 1.00 29.39 £ 

ATPM 2337 CP PRP A 281 38.761 15.138 20.64 6 1.00 26.82 £ 

AX21 2338 W ARG A 292 40.281 14.567 23.587 1.00 27.88 H 

35 ATOM 2339 CA ARG A 282 40.806 13.817 24.720 1.00 34.08 £ 

ATOM 2319 C ARG A 282 41.977 12.918 24.384 1.00 27.62 £ 

Aiai 2311 P ARP A 282 41.913 12.182 23.425 1.00 23.83 Q 

ATPM Z1A2 CB ARG A 282 39.676 13.017 25.405 l.OO 20.89 £ 

AICJ3 2343 CG ARg A 282 40.035 12.467 26.775 1.00 22.81 £ 

40 Aiai UA1 CD ARG A 282 38.762 11.925 27.442 1.00 26.77 £ 

A1SM 2315 ME ARg A 282 38.963 11.345 28.781 1.00 36.48 

ATPM 2316 C? ARG A 292 3B.518 10.139 29.164 1.00 37.74 £ 

Aiai 2347 MH1 ARG A 262 37.813 9.360 28.346 1.00 28.45 H 

A TOM 2348 NH2 ARG A 282 38.754 9.700 30.384 1.00 27.25 N 

45 AlfiM 2319 N LYS A 283 43.016 12.963 2 5.223 1.00 2B.91 jj 
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ATOM 2350 CA LYS A 283 44.217 12.171 25.051 1.00 24.32 £ 

ATOM 2351 C LYS A 283 44.796 11.766 26.404 1.00 29.57 £ 

ATOM 2352 Q LYS A 283 45.262 12.626 27.136 j.00 33,16 2 

&XCH 235? cb LYS a 293 45.226 13.PP9 21.297 1.99 21.93 £ 

5 ATOM 2354 CG LYS A 283 46.111 12.251 23.316 1.00 32.38 £ 

ATOM 2355 CD LYS A 283 46.526 13.171 22.143 1.00 95.77 £ 

ATOM 2356 CE LYS A 283 45.710 12. 937 20.836 1 ,9 9199.9 9 £ 

ATOM 2357 NZ LYS A 283 46.418 13.332 19,53$ 1.00100.00 H 

ATCM 2358 N LBV ft 294 44.747 19,467 2$, 734 1,99 23,37 M 

10 ATCM 2359 CA LEU A 284 45.327 9.905 27.997 1.00 16. OB £ 

AJ£M 2360 C LEU A 284 45.463 9,396 29.P47 1,99 29,46. £ 

ATCM 2361 O LEU A 284 44.679 7.655 27,446 1.Q9 25.45 Q 

ATCM 2362 CB LEU A 284 44.641 10.387 29.284 1.00 16.30 £ 

ATCM 2363 CG LEU A 284 43.334 9.700 29.714 1.00 25.97 £ 

15 ATCM 2364 CD1 LEU A 284 42.881 10.069 31.152 l,0p 2 2- 11 £ 

ATCM 2365 CD2 LEU A 284 42.203 9.953 28.693 1,99 23.92 £ 

ATCM 2366 N LEU A 285 46.453 7.939 29,920 1,P9 19, 51 M 

A2£M. 2357 CA LEV A 295 46.732 6.527 29.993 1,9P 16.77 £ 

ATOM 2368 C LEU A 285 45, 999 5-965 39,996 1.00 30.75 £ 

20 ATCM 2369 O LEU A 285 45,576 6.439 31.058 1.00 22.02 Q 

ATCM 2370 CB LEU A 285 48.229 6.389 29.585 1.00 15.85 C 

AlCM 2371 CG LEU A 285 49.307 6,979 29.672 1.99 21.51 £ 

ATCM 2372 CDl LEU A 285 50.703 6.705 29.122 1,90 15,1 5 £ 

ATCM 2373 CD2 LEU A 285 49.051 6.368 27,330 1,0Q 16 T 94 £ 

25 atcm 2374 n asp A 286 45.565 4.59? 29,731 1,P9 26.62 M 

ATCM 2375 CA ASP A 286 44.945 3 f 726 ?0 T 698 l r p9 10,99 £ 

ATCM 2376 C ASP A 286 46.128 3.055 31.498 1.00 20.54 £ 

ATCM 2377 O ASP A 286 46.991 2.372 30.938 1.00 23.38 O 

ATCM 2378 CB ASP A 286 44.073 2.702 29.970 1.00 14.65 £ 

30 ATCM 2379 CG ASP A 286 43.409 1.699 30.943 1.00 24.60 C 

ATCM 23B0 OD1 ASP A 286 43.932 1,437 32,993 1-PO 24,60 O 

ATCM 2381 OD2 ASP A 286 42,316 1.231 39.593 i t Q9 26,93 Q 

AlfiM 2392 W VAX, a 287 46 T 239 3,3j7 32,791 4.99 15,44 H 

ATCM 2383 CA VAL A 287 47.354 2. 816 33.556 1.00 15.58 £ 

35 ATCM 2384 C VAL A 287 46.973 1.695 34.521 1.00 16.48 £ 

ATCM 2385 O VAL A 287 47.613 1.473 35.572 1.00 16.63 Q 

ATCM 2386 CB VAL A 287 48.101 4,996 34,269 1,00 29-64 £ 

ATOM 2387 CGI VAL A 287 48.534 5.085 33.224 1.99 19.39 £ 

AlfiM 2388 CG2 VAL A 287 17.173 4,679 35,259 1,09 37.7? C 

40 ATCM 2389 N THR A 288 45.904 0.992 34.152 1.00 22.27 U 

ATCM 2390 CA THR A 2 68 45.428 -0.152 34.956 1,99 19.34 £ 

ATCM 2391 C THR A 288 46.561 -1. 177 35.227 1.00 27.47 C 

atcm 2392 o THR A, 299 46.779 -1.596 36.365 1.99 24.97 2 

ATOM 2393 CB THR A 288 44.288 -0.909 34.244 1.00 22.86 £ 

45 ATCM 2394 OG1 THR A 268 13,129 -9,096 34.196 l,po 24,94 £ 
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AXCM 2395 CG2 THR A 288 43.916 -2.113 35.024 1.00 25.08 £ 

ATCM 2396 H ARG A 299 47.29P -1.595 31.179 l.QP 26-09 H 

AXSM 2397 CA ARC A 299 18.^29 -2.506 31,31? 1,00 16,9? c 

AXCM 23_SS £ ARG A 289 49.405 -2.037 3 5.408 1.00 22.96 £ 

5 AXOl 2399 O ARG A 2B9 49.847 -2.790 36.275 1.00 23.03 O 

AXCM 2400 CB ARG A 289 49.208 -2.607 32.976 1.00 12.43 £ 

ATOM 2401 CG ARG A 289 48.934 -3.804 32.103 1.00 29.39 £ 

AXCM 2102 CP ARG ft 289 50.016 -1.102 31.037 1,90 25,88 £ 

ATOtt 2403 NE ARG A 289 49.441 -4.996 30.020 1,PQ 17,26 N. 

10 ATOM 2404 CZ ARG A 289 50.053 -5.459 28.930 1.00 38.82 £ 

ATOM 2405 NH1 ARG A 289 51.306 -5.153 28.660 1.00 13.51 ft 

ATCM 2406 NH2 ARG A 289 49.400 -6. 262 28.09 6 1.00 37.68 H 

AT/CM 2107 H LEU A 290 49.815 -0.786 35.306 1.00 26.60 

ATCM 2108 CA LBV A 290 50.909 -Q.254 36.219 1 T QQ 25,12 £ 

15 ATOM 2409 C LEU A 290 50.324 -0.376 37.656 1.00 24.17 C 

AXCM 2110 O LEU A 290 51.072 -0.759 38.574 1.00 19.94 Q 

ATCM 2111 CB LEV A 29Q 51.000 1.219 35.876 1.00 24.66 £ 

AXCM 2412 CG LEU A 290 52.281 2.019 36.066 1.0O 24.67 £ 

AXCM 211? CP1 LEV A 29Q 51.992 3.479 36.504 1.00 29.25 £ 

20 AXCH 2414 CD2 LEU A 290 53.450 1.335 36.788 1.00 15.82 £ 

A TPM 2415 N HIS A 291 49.093 0.075 37.868 1.00 30.10 N. 

AXCM 2115 CA His A 291 19.513 Q T P71 39.212 1 ,PQ 31 . 17 £ 

AXCM 2117 c HIS A 291 19.111 -1.367 39.730 1,PQ 13.41 £ 

AXCM 2_US Q HIS A 291 48.621 -1.654 40.929 1.00 38.81 O 

25 Aiai 2119 CB HIS A 291 17.113 0,574 39.113 l.PO 28.01 £ 

ATCM 2420 CG HIS A 291 47.097 2.153N 38.9B4 1.00 29.68 £ 

AXCM 2421 NDl HIS A 291 48.242 2.921 39.015 1.00 35.63 N 

AXSM 2422 CD2 HIS A 291 46.068 3.024 38.855 1.00 31.18 £ 

ATCM 2423 CE1 HIS A 291 47.926 4.197 38.845 1.00 24.20 £ 

30 AXCM 2424 NE2 HIS A 291 46.612 1.269 39,747 1,QQ $1.9? N 

AXCM 2125 W GLN A 292 48.048 -2.260 38.821 1.00 30.71 N. 

AXCM 2426 CA gLN A 292 47.950 -3.654 39.181 1.00 34.82 £ 

ATOM. 2427 C GLH A 292 49.287 -4.197 39.622 1.00 36.93 C 

AXCM 2429 Q. GLN A 292 49.323 -5.040 40.510 l.OO 27.56 O 

35 AXfiM 2429 CB GLN A 292 47.322 -4.487 38.069 1.00 28.23 C 

AXCM 2430 CG gln a 292 45,798 -1,105 39.171 1.00 91.15 £ 

AXCM 2431 CD GLN A 292 45.023 -4.954 36.963 1.00100.00 £ 

AXCM 2432 QE1 SLN A 292 45.597 -5,410 35,951 1,QQ 99,55 o 

AXCM 2433 NE2 GLN A 292 43.687 -4.895 37.073 1.00 40.86 N 

40 AXCM 2434 N LEU A 293 50.375 -3.658 39.058 1.00 31.75 H 

AXCM 2435 CA LEU A 293 51.750 -4.072 39.383 1.00 22.67 £ 

AXCM 2436 £ LEU A 293 52,239 -3.323 40,613 1-00 29,64 £ 

AXCM 213J Q LEU A 293 53.420 -3.377 41.017 1.00 22.27 Q 

AXCM 2438 CB LEV A 293 52.665 -3,769 39.205 l.Qp 25.57 £ 

45 AXCH 2439 CG LEU A 293 52.497 -4.703 37 .016 1.00 35.11 C 
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ATOM 2440 CDl LEU A 293 53,30$ -4,170 35.83$ 1 T QQ 29,25 £ 

ATOH 2441 CD2IJ5UA 293 52,965 ~6tU9 37,439 1,99 47,91 £ 

2442 h gly A 294 51,316 -2,519 41,111 1,QP 33,08. H 

ATOM 2443 CA GLY A 294 51.488 -1.793 42.347 1.00 24.90 £ 

5 ATCM 2444 C GLY A 294 52.272 -0.512 42.326 1.00 29.31 £ 

ATCM 2445 O GLY A 294 53.070 -0.249 43.223 1.00 25.25 Q 

ATOM 2446 N TRP A 295 52.000 0.347 41.368 1.00 27.83 H 

ATCM 2447 CA TRP A 295 52.687 1.623 41,385 l.PP 19,45 £ 

ATCM 2448 C TRP A 295 51.684 2.731 41.081 1,00 25,79 £ 

10 ATCM 2449 O TRP A 295 50.765 2.527 40.297 1.00 20.43 Q 

ATCM 2450 CB TRP A 295 53.961 1.614 40.524 1.00 12.85 £ 

ATCM 2451 CG TRP A 295 54.750 2.911 40.618 1,QQ 23.94 £ 

ATCM 2452 CDl TRP A 295 55.897 3.161 41.368 1.00 23.68 £ 

atcm 2453 CD2 trp a 295 54.415 4,159 39.979 IrQQ 20,72 £ 

15 ATCM 2454 HEl TRP A 295 56,258 4,493 41,244 l-QO 18.67 H 

ATCM 2455 CE2 TRP A 295 55.389 5.113 49.373 1,99 20,95 £ 

A2£tJ 2456 CE3 TRP A 295 53.4P6 4-55Q 39.1Q2 1,PP 21,47 £ 

atcm 2457 cz2 TRP A 295 55,338 6.439 39.958 l.PP 17,58 £ 

ATCM 2458 CZ3 TRP A 295 53.403 5.873 38.632 1.00 21.57 £ 

20 ATCM 2459 CH2 TRP A 295 54.368 6.787 39.058 1.00 19.45 £ 

ATCM 2460 N TYR A 296 51.709 3 .797 41.884 1.00 25.17 H 

atcm 2461 ca TYR a 296 50 .7 2 9 4.883 41.731 l.PQ 24, 9Q £ 

ATOM 2462 C TYR A 296 51,517 6.178 41,857 IrPP 39,95 £ 

ATCM 2463 O TYR A 296 52.363 6.272 42,745 1-0P 21, 27 O 

25 ATCM 2464 CB TYR A 296 49.654 4,813 42.84P 1,99 25.18 £ 

ATOM 24 65 CG TYR A 296 48.685 3.651 42.744 1-QP 23,Q4 £ 

ATCM 2466 CDl TYR A 296 49-078 g T 343 43 T Q88 1-P0 31. $2 £ 

ATCM 2467 CD2 TYR A 296 47,389 3,853 42.289 lrPP 26,92 c 

atcm 2468 eel tyr a 296 49.293 1.268 42,935 1,99 24.42 £ 

30 ATOM 2469 CE2 TYR A 296 46,49? 2,770 42,127 1 T QP 24-81 £ 

atcm 2470 cz tyr a 296 46,992 1.483 42.464 1.99 39,41 £ 

atcm 2471 oh tyr a 296 45,994 9,434 42,337 1-QQ 66,19 S 

ATCM 2472 N HIS A 297 51.324 7.123 40.924 1.00 20.95 fi 

atcm 2473 ca his a 297 52.139 9.343 49.938 1,99 26.86 £ 

35 ATCM 2474 C HIS A 297 51.947 9.175 42.210 1.00 35.01 C 

ATCM 2475 O HIS A 297 50.885 9.132 42.874 1.00 26.92 O 

ATCM 2476 CB HIS A 297 51,919 9,192 39,733 l t 99 25,77 £ 

ATOM 2477 CG HIS A 297 59,489 9.842 39.803 1.99 31.16 £ 

ATCM 2478 MD1 HIS A 297 49.314 9.145 39.633 l f P0 34,21 H 

40 ATCM 2479 CD2 HIS A 297 50.135 11.094 40.167 1.00 25.83 £ 

ATCM 2480 CE1 HIS A 297 48.290 9.972 39.776 1.00 24.14 £ 

Aigtt 2481 «E2 His A 2?7 48.761 11.164 49.997 1,99 23,35 n 

atcm 2482 m glu a 298 52.983 9.926 42.554 l.QQ 24.98 U 

ATOM 2483 CA GLU A 298 52,957 19,6?? 43.799 1-99 27.65 £ 

45 ATCM 2484 C GLU A 298 52,831 12,197 43.741 1t99 36.86 £ 
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ATOM 2195 P gLV A 239 52,433 12,792 44.718 l.oo 43.61 Q 

AISM 2195 CB gLV A 299 51.153 IP. 31? 11.695 1,90 22,92 £ 

AiaJ 2187 CG GLV A 299 54.094 8.943 45.285 1.00 36.42 £ 

ATOM 248B CD GLU A 298 54.999 6.664 46.406 1.00100.00 £ 

5 Alas 2489 OBI GLU A 298 55.223 8.561 46.152 1.00 44.79 O 

ATCM 2490 OE2 GLU A 298 54.526 8.470 47.547 1.00100.00 Q 

AX£U 2191 W I LB A 299 53.232 12.800 42.639 1.00 23.49 H 

AXCM 2492 CA ILE A 299 53.268 14.244 42.562 1.00 13.25 £ 

ATCM 2193 C I LB A 299 52.016 14.848 41.906 1.00 27.05 £ 

10 AEM 2191 Q IMS A 299 51.681 14. 530 40.757 1.00 26.73 2 

AISM 2495 CB I LB A 299 54.586 14.711 41.862 1.00 15.93 C 

AXfiM. 2496 CGI ILK A 299 55.836 14.183 42.606 1.00 23.83 £ 

ATOM 2497 Cg2 ILE A 299 54.596 16.213 41.541 1.00 17.37 £ 

ATCM 2499 CP1 ILE A 299 57.232 14.221 41.787 1.00 21.32 C 

15 ATCM 2499 H ?SR A 300 51.323 15.716 42.64 8 1.00 18.55 

ATCM 25QQ CA SBR A 309 50.177 16.449 42.091 1.00 19.58 £ 

ATCM 2591 C SBR A 399 50.714 17. 415 41.042 1.00 17.29 £ 

AX£W 2592 P SBR A 3Q0 51.824 17.941 41.178 1.00 21.06 Q 

ATCM 2593 CP gfiR A 390 49.542 17.307 43.181 1.00 16. 7B £ 

20 AJCM. 2504 OG SER A 300 50.548 17.969 43.923 1.00 75.80 Q 

AXCU 2595 N USD A 301 49.870 17.755 40.075 1.00 16.13 H 

AEM 2595 CA LBV A 391 50.246 18.675 39.014 1.00 17.70 £ 

ATCM 2597 C LBV A 391 50.689 19.964 39.646 1.00 20.11 £ 

ATCM 2508 Q LEU A 301 51.714 20.568 39.303 1.00 20.46 Q 

25 ATCM 2509 CB LEU A 301 48.990 18.981 38.197 1.00 17.92 £ 

ATCM 2510 CG LEU A 301 49.182 20. 030 37.112 1.00 25.15 £ 

ATCM 2511 CP1 LBV A 391 50.233 19.552 36.086 1.00 18.82 C 

AICU 2512 CP2 LEV A 391 47.854 20.177 36.436 1.00 25.88 £ 

ATCM 2513 N GLU A 302 49.845 20.398 40.554 1.00 27.01 N 

30 ajjch 2514 CA gLV A 392 59.953 21,636 41.299 1,99 37.72 c 

AICU 2515 C gLV A, 392 51.410 21.618 41.996 1.00 29.99 £ 

AISW 2515 P GLV A 392 52.245 22.514 41.798 1.00 27.15 Q 

ATSW 2517 CB GLV A 39? 48.899 21.841 42.275 1.00 43.10 £ 

AlfiH 25JJ CG GLU A 302 49 .061 23.061 43.174 1.00 90.85 £ 

35 AICU 25;? CP gLV A 392 48.451 24.324 42.580 1.00100.00 £ 

ATCM 2529 C-Bl gLV A 392 47.566 24.209 41.706 1.00100.00 Q 

AICU 2521 OE2 GLU A 302 48.808 25.432 43.036 1.00 64.50 O 

AISU 2522 W ALA A 303 51.646 20.591 42 .801 1.00 8.72 ft 

AICU 2523 CA ALA A 303 52.937 20.455 43.459 1.00 15.03 £ 

40 AICU 2524 C ALA A 303 54.102 20.355 42.450 1.00 19.85 £ 

AX£M 2525 P ALA A 393 55.104 21.090 42.553 1.00 22.24 Q 

Aiaj 2525 CB ALA A 3Q3 52.938 19.258 44.410 1.00 18.97 £ 

AICU 2527 M gLY. A 391 53.953 19.472 41.467 1.00 13.05 g 

AICU 2528 ca gLY A 394 54,979 19.321 19.419 1-QQ 9,94 £ 

45 AJCU 2529 C GLY A 304 55.239 20.621 39.695 1.00 20.31 C 
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ATOM 2530 O GLY A 304 56.394 2P .9QQ 39,322 1,99 14,30 Q 

ATCM 2531 N LEV A 305 51.191 21.393 39,361 l r QQ 19,76 H 

AIOi 253? CA LEV A 3g5 54,4,9? 22. 622 39,611 1,99 20.29 £ 

ATCM 2533 C LEU A 305 55.281 23.669 39.456 1.00 28.92 £ 

5 ATOM 2534 O LEU A 305 56.194 24 .385 38.974 1.00 17.69 Q 

ATCM 2535 CB LEU A 305 53.202 23.245 39,93? 1.00 24.03 £ 

ATCM 2536 CG LEU A 305 52.357 22.647 36.880 1.00 27.66 £ 

ATOM 2537 CD1 LEU A 305 50.975 23.384 36.799 l.QQ 13,44 £ 

ATOM 2538 CD2 LEU A 305 53.079 22.724 35,513 1,99 19.39 £ 

10 ATCM 2539 N ALA A 306 54.904 23.757 40.724 1.00 19.94 if 

AlfiH 2540 CA ALA A 306 55 r 544 34,669 41,655 1-QQ 24,79 £ 

ATCM 2541 C ALA A 306 57.035 24 .380 41.743 1.00 27.51 £ 

ATCM 2542 O ALA A 306 57.852 25.280 41.662 1,QQ 29,68 Q 

ATCM 2543 CB ALA A 306 54.937 24.471 43.992 1,0Q 17.97 £ 

15 atcm 2544 N sera 307 57.379 23.137 42.911 l.QQ 19,46 H 

ATCM 2545 CA SER A 307 58.793 22.756 42.162 1,QP 16.31 £ 

ATCM 2546 C SER A 307 59.547 22.885 40.832 1.00 22.66 £ 

ATCM 2547 O SER A 307 60.742 23.212 19,786 1,QQ 28,47 Q 

AEM 2548 cb ser a 307 59,951 21r3<>4 42.622 1.99 20,47 £ 

20 atcm 2549 OG ser a 307 59,517 29.454 41.526 1,99 29. Q3 Q 

ATCM 2550 N THR A 308 58.849 22.631 39.735 1.00 27.31 M 

ATCM 255^ ca THR a, 399 59.459 22,739 39,413 1,99 2?, 8? £ 

ATCM 2552 C THR A 308 59.757 24.216 38.107 1,99 26,96 £ 

ATCM 2553 O THR A 308 69,919 24.546 37,591 l,po 29 T 99 £ 

25 ATCM 2554 CB THR A 308 58.536 22.115 37.318 l t PQ 19,72 £ 

Aicu 2555 OGi thr A 308 59.356 29.714 37.545 1,99 29.17 Q 

ATCM 2556 CG2 THR A 308 59.094 22.330 35.923 1-00 12,37 £ 

ATCM 2557 N TYR A 309 58.846 25.118 39,453 1,99 29.2P E 

ATCM 2558 CA TYR A 309 59.110 26.549 38.241 1.00 31.09 C 

30 ATCM 2559 C TYR A 309 60.383 27 .059 39.045 1.00 16.31 C 

ATCM 25 60 O TYR A 309 61.179 27.858 38.577 1.00 16.91 O 

ATCM 2561 CB TYR A 309 57.819 27.373 38.533 1,99 31.19 £ 

atcm 2562 cg tyr a 309 57.944 29.995 39.392 1t99 14.57 £ 

ATCM 2563 CD1 TYR A 309 58.397 29.457 37.224 1.99 17.51 £ 

35 ATCM 2564 CD2 TYR A 309 57.575 29.757 39.442 1.00 24.99 £ 

ATCM 2565 CE1 TYR A 309 58.527 30 .801 37.100 1.00 18.41 C 

AT CW 25 96 CB2 TYR A 3Q9 57,744 31.129 39.351 1,99 19,94 £ 

ATCM 2567 CZ TYR A 309 58.212 31.641 38.164 1,09 29.13 C 

ATCM 2568 OH TYR A 309 58.300 33.004 37.966 1 T 0Q 29,22 Q 

40 ATOM 2569 H GLN A 310 60.560 26 .579 40.260 1,99 15.41 U 

ATCM 2570 CA GLN A 310 61.705 26 .964 41.087 l.QQ 22,35 £ 

ATCM 2571 C GLN A 310 63.001 26.492 40.446 1,99 31.46 £ 

ATCM 2572 O GLN A 310 64.009 27.191 40.442 1, 99 3 3.42 £ 

ATCM 2573 CB GLN A 310 61.587 26.335 42.482 1.00 17.67 C 

45 ATOM 2574 CG GLN A 310 62.579 26.921 43.461 1.00 57.58 C 
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AX£ti 2575 CD GLN A 310 62.287 28.370 43.762 1.00 65.14 £ 

ATOM 2576 OE1 GLN A 310 61.134 28.754 44.000 1.00 41.94 £ 

AICM 2577 NE2 SLN A 31Q 63.330 29.194 43.801 1.00 99.09 H 

AX£M 2579 W TRP A 311 62.957 25.321 39.830 1.00 28.76 U 

5 AISM 2579 CA TRP A 311 64.146 24.822 39.163 1.00 26.29 £ 

Aiga 2599 C TRP A 311 64.474 25.769 38.040 1.00 17.91 £ 

AXai 2581 O TRP A 311 65.599 26.193 37.880 1.00 22.89 O 

AXSE 2582 « TRP A 311 63.938 23.383 38.643 1.00 27.53 £ 

AI£M 2583 CG TRP A 3U 65.176 22.784 38.119 1.00 17.82 £ 

10 AlfiM 2584 CP1 TRP A 311 66.132 22.090 38.826 1.00 20.21 £ 

AlfiM 2595 £02 TRF A 311 65.652 22.8B1 36.784 1.00 17.99 £ 

Ami 2596 WEI TRP A 311 67.197 21.776 37.992 1.00 20.39 H 

ATOM 2587 CE2 TRP A 311 66.933 22.284 36.746 1.00 19.57 £ 

AX21 2599 CE3 TRP A 311 65.141 23.461 35.621 1.00 20.26 £ 

15 ATCM 2589 CZ2 TRP A 311 67.686 22.236 35.599 1.00 14.25 £ 

AXStf 2590 £g3 TRP A 311 65.901 23.446 34.501 1.00 18.59 £ 

AlStf 2591 CH2 TRP A 311 67.169 22.831 34.494 1.00 16.86 £ 

Aiai 2592 W PHB A 312 63.469 26.109 37.25 6 1.00 17.47 H 

ATSM 259? CA PHB A 312 63.665 27.064 36.17 9 1.00 20.14 £ 

20 ATOi 2594 C PHB A 312 64.224 28. 371 36.733 1.00 18.33 £ 

AXCM 2595 Q PHE A 312 65.080 29.024 36.104 1.00 24.76 Q 

AXSH 2596 £B PHE A 312 62.328 27.318 35.458 1.00 29.51 £ 

ATCM 2597 CG PHB A 312 62.328 28.544 34.603 1.00 28.52 £ 

AZfitf 259P cpi PHE A 31? 52.993 28.508 33.338 1.00 30.53 £ 

25 Ann 2599 CP2 PHE A 312 €1,925 29 T 759 35.194 1 .Q9 2 9 .31 £ 

ATCM 26QQ CEl PHE A 312 62.936 29.660 32.554 1.00 34.73 £ 

ATCM 2601 CE2 PHE A 312 61.900 30.904 34.36 2 1.00 38.40 £ 

ATCM 2602 CZ PHB A 312 62.432 30.860 33.063 1.00 40.73 £ 

AISM 2603 N LEU A 313 63.697 28.787 37.876 1.00 22.46 H 

30 Aiai 2694 CA LBV A 313 64.170 30.025 38.516 1.00 28.47 £ 

ATCM 2605 C LEU A 313 65.627 29.827 38.898 1.00 37.53 C 

AISM 2996 q LEV A 313 66,452 39.693 39.62? 1,99 34. 2Q Q 

ATCM 2607 Cg LEV A 313 63.375 30.410 39.783 1.00 20.44 £ 

AXCM 26Q9 CG LEV A 313 61.955 30.897 39.555 1.00 16.29 £ 

35 A TCM 2609 CD1 LEU A 313 61.499 31.399 40.871 1.00 15.94 £ 

AXCM 261Q CP2 LEV A 313 61.959 31.961 38.524 1.00 14.44 C 

Aiai 2611 N GLU A 314 65.953 28.685 39. SOB 1.00 30.70 H 

ATSM 2612 CA GLU A 314 67.353 2B.432 39.875 1.00 24.15 £ 

AICU 2913 C CLV A 314 68.291 28.149 38.703 1.00 36.34 £ 

40 ATCM 2611 9 5LV A 314 69.485 28.047 38.890 1.00 43.10 Q 

ATCM 2615 CP GLU & 314 67.459 27.366 40.947 1.00 19.90 £ 

AICM 2616 CG GLU A 314 66.634 27.754 42.141 1.00 27.37 £ 

ATSM 2617 CP GLU ft ?14 66.450 26.666 43.1B2 1.00 31.09 £ 

ATCM 2619 QE1 ?LV A 314 97.157 25.949 43.995 1.09 59,69 Q 

45 AJSM 2619 9E2 CLV A 314 65.634 26.872 44.125 1.00 46.20 Q 
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While various embodiments of the present invention have been described in detail, 
it is apparent that modifications and adaptations of those embodiments will occur to those 
skilled in the art. However, it is to be expressly understood that such modifications and 
adaptations are within the spirit and scope of the present invention, as set forth in the 
5 following claims. 
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What is claimed: 

1 . A method for producing ascorbic acid or esters thereof in a microorganism, 
comprising cuhuring a microorganism having a genetic modification to increase the action 
of an enzyme selected from the group consisting of hexokinase, glucose phosphate 

5 isomerase, phosphomannose isomerase, phosphomannomutase, GDP-D-mannose 
pyrophosphorylase, GDP-D-mannose GDP-L-galactose epimerase, GDP-L-galactose 
phosphorylase, L-galactose- 1 -P-phosphatase, L-galactose dehydrogenase, and L- 
galactono-Y-lactone dehydrogenase; and recovering said ascorbic acid or esters thereof. 

2. A method, as claimed in Claim 1, wherein said genetic modification is a 
0 genetic modification to increase the action of an enzyme selected from the group 

consisting of GDP-D-mannose: GDP-L-galactose epimerase, GDP-L-galactose 
phosphorylase, L-galactose- 1 -P-phosphatase, L-galactose dehydrogenase, and L- 
galactono-y-lactone dehydrogenase. 

3. A method, as claimed in Claim 1, wherein said genetic modification is a 
5 genetic modification to increase the action of an epimerase that catalyzes conversion of 

GDP-D-mannose to GDP-L-galactose. 

4. A method, as claimed in Claim 3, wherein said genetic modification is a 
genetic modification to increase the action of GDP-D-mannose : GDP-L-galactose 
epimerase. 

0 5. The method of Claim 3, wherein said genetic modification comprises 

transformation of said microorganism with a recombinant nucleic acid molecule that 
expresses said epimerase. 

6. The method of Claim 5, wherein said epimerase has a tertiary structure that 
substantially conforms to the tertiary structure of a GDP-4-keto-6-deoxy-D-mannose 

5 epimerase/reductase represented by atomic coordinates having Brookhaven Protein Data 
Bank Accession Code lbws. 

7. The method of Claim 5, wherein said epimerase has a structure having an 
average root mean square deviation of less than about 2.5 A over at least about 25% of 
Ccc positions of the tertiary structure of a GDP-4-keto-6-deoxy-D-mannose epimerase/ 

0 reductase represented by atomic coordinates having Brookhaven Protein Data Bank 
Accession Code lbws. 
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8. The method of Claim 5, wherein said epimerase has a tertiary structure 
having an average root mean square deviation of less than about 1 A over at least about 
25% of Ca positions of the tertiary structure of a GDP-4-keto-6-deoxy-D-mannose 
epimerase/reductase represented by atomic coordinates having Brookhaven Protein Data 

5 Bank Accession Code lows. 

9. The method of Claim 5, wherein said epimerase comprises a substrate 
binding site having a tertiary structure that substantially conforms to the tertiary structure 
of the substrate binding site of a GDP-4-ktfo-6-deoxy-D-mannose epimerase/reductase 
represented by atomic coordinates having Brookhaven Protein Data Bank Accession Code 

10 lbws. 

10. The method of Claim 9, wherein said substrate binding site has a tertiary 
structure with an average root mean square deviation of less than about 2.5 A over at least 
about 25% of Ca positions of the tertiary structure of a substrate binding site of a GDP-4- 
keto-6-deoxy-D-mannose epimerase/reductase represented by atomic coordinates having 

15 Brookhaven Protein Data Bank Accession Code lbws. 

1 1 . The method of Claim 5, wherein said epimerase comprises a catalytic site 
having a tertiary structure that substantially conforms to the tertiary structure of the 
catalytic site of a GDP-4-keto-6-deoxy-D-mannose epimerase/reductase represented by 
atomic coordinates having Brookhaven Protein Data Bank Accession Code lbws. 

20 12. The method of Claim 1 1, wherein said catalytic site has a tertiary structure 

with an average root mean square deviation of less than about 2.5 A over at least about 
25% of Ca positions of the tertiary structure of a catalytic site of a GDP-4-keto-6-deoxy- 
D-mannose epimerase/reductase represented by atomic coordinates having Brookhaven 
Protein Data Bank Accession Code lbws. 

25 13. The method of Claim 1 1 , wherein said catalytic site comprises the amino 

acid residues serine, tyrosine and lysine. 

14. The method of Claim 13, wherein tertiary structure positions of said amino 
acid residues serine, tyrosine and lysine substantially conform to tertiary structure 
positions of residues Serl07, Tyrl36 and Lysl40, respectively, as represented by atomic 

30 coordinates in Brookhaven Protein Data Bank Accession Code lbws. 

15. The method of Claim 5, wherein said epimerase binds NADPH. 
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16. The method of Claim 5, wherein said epimerase comprises an amino acid 
sequence that aligns with SEQ ID NO: 1 1 using a CLUSTAL alignment program, wherein 
amino acid residues in said amino acid sequence align with 100% identity with at least 
about 50% of non-Xaa residues in SEQ ID NO: 1 1 . 
5 17. The method of Claim 5, wherein said epimerase comprises an amino acid 

sequence that aligns with SEQ ID NO: 1 1 using a CLUSTAL alignment program, wherein 
amino acid residues in said amino acid sequence align with 100% identity with at least 
about 75% of non-Xaa residues in SEQ ID NO: 1 1 . 

18. The method of Claim 5, wherein said epimerase comprises an amino acid 
1 0 sequence that aligns with SEQ ID NO: 1 1 using a CLUSTAL alignment program, wherein 

amino acid residues in said amino acid sequence align with 100% identity with at least 
about 90% of non-Xaa residues in SEQ ID NO: 1 1 . 

19. The method of Claim 5, wherein said epimerase comprises an amino acid 
sequence having at least 4 contiguous amino acid residues that are 100% identical to at 

1 5 least 4 contiguous amino acid residues of an amino acid sequence selected from the group 
consisting of SEQ ID NO:2, SEQ ID NO:4, SEQ ID NO:6, SEQ ID NO:8 and SEQ ID 
NO: 10. 

20. The method of Claim 5, wherein said recombinant nucleic acid molecule 
comprises a nucleic acid sequence comprising at least about 12 contiguous nucleotides 

20 having 100% identity with at least about 12 contiguous nucleotides of a nucleic acid 
sequence selected from the group consisting of SEQ ID NO: 1, SEQ ID NO:3, SEQ ID 
NO:5, SEQ ID NO:7 and SEQ ID NO:9. 

2 1 . The method of Claim 5, wherein said epimerase comprises an amino acid 
sequence having a motif: Gly-Xaa-Xaa-Gly-Xaa-Xaa-Gly . 

22. The method of Claim 5, wherein said recombinant nucleic acid molecule 
comprises a nucleic acid sequence that is at least about 15% identical to a nucleic acid 
sequence selected from the group consisting of SEQ ID NO: 1, SEQ ID NO:3, SEQ ID 
NO:5, SEQ ID NO: 7 and SEQ ID NO: 9, as determined using a Lipman-Pearson method 
with Lipman-Pearson standard default parameters. 

23 . The method of Claim 5, wherein said recombinant nucleic acid molecule 
comprises a nucleic acid sequence that is at least about 20% identical to a nucleic acid 
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sequence selected from the group consisting of SEQ ID NO: 1, SEQ ID NO:3, SEQ ID 
NO:5, SEQ ID NO:7 and SEQ ID NO:9, as determined using a Lipman-Pearson method 
with Lipman-Pearson standard default parameters. 

24. The method of Claim 5, wherein said recombinant nucleic acid molecule 
5 comprises a nucleic acid sequence that is at least about 25% identical to a nucleic acid 

sequence selected from the group consisting of SEQ ID NO: 1, SEQ ID NO:3, SEQ ID 
NO:5, SEQ ID NO:7 and SEQ ID NO:9, as determined using a Lipman-Pearson method 
with Lipman-Pearson standard default parameters. 

25. The method of Claim 5, wherein said recombinant nucleic acid molecule 
0 comprises a nucleic acid sequence that hybridizes under stringent hybridization conditions 

to a nucleic acid sequence encoding a GDP-4-keto-6-deoxy-D-mannose epim erase/ 
reductase. 

26. The method of Claim 25, wherein said nucleic acid sequence encoding said 
GDP-4-keto-6-deoxy-D-mannose epimerase/ reductase is selected from the group 

5 consisting of SEQ ID NO: 1, SEQ ID NO:3 and SEQ ID NO:5. 

27. The method of Claim 25, wherein said GDP-4-keto-6-deoxy-D-mannose 
epimerase/reductase comprises an amino acid sequence selected from the group consisting 
of SEQ ID NO:2, SEQ ID NO:4 and SEQ ID NO:6. 

28. A method, as claimed in Claim 1, wherein said microorganism is selected 
3 from the group consisting of bacteria, fungi and microalgae. 

29. A method, as claimed in Claim 1, wherein said microorganism is acid- 
tolerant. 

30. A method, as claimed in Claim 1, wherein said microorganism is a 
bacterium. 

5 3 1 . A method, as claimed in Claim 30, wherein said bacterium is selected from 

the group consisting of Azotobacter and Pseudomonas 

32. A method, as claimed in Claim 1, wherein said microorganism is a fungus. 

33 . A method, as claimed in Claim 32, wherein said microorganism is a yeast. 

34. A method, as claimed in Claim 33, wherein said yeast is selected from the 
3 group consisting of Saccharomyces yeast. 
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35. A method, as claimed in Claim 1, wherein said microorganism is a 
microalga. 

36. A method, as claimed in Claim 3 5, wherein said microalga is selected from 
the group consisting of microalgae of the genera Prototheca and Chlorella. 

5 37. A method, as claimed in Claim 36, wherein said microalga is selected from 

the genus Prototheca. 

38. A method, as claimed in Claim 1, wherein said microorganism further 
comprises a genetic modification to decrease the action of an enzyme having GDP-D- 
mannose as a substrate, other than GDP-D-mannose:GDP-L-galactose epimerase. 
10 39. A method, as claimed in Claim 38, wherein said genetic modification to 

decrease the action of an enzyme having GDP-D-mannose as a substrate, other than GDP- 
D-mannose:GDP-L-galactose epimerase is a genetic modification to decrease the action 
of GDP -D-manno se-dehydrogenase. 

40. A method, as claimed in Claim 1, wherein said microorganism is acid- 
tolerant and said step of culturing is conducted at a pH of less than about 6.0. 

41. A method, as claimed in Claim 1, wherein said microorganism is acid- 
tolerant and said step of culturing is conducted at a pH of less than about 5.5. 

42. A method, as claimed in Claim 1, wherein said microorganism is acid- 
tolerant and said step of culturing is conducted at a pH of less than about 5.0. 

43. A method, as claimed in Claim 1, wherein said step of culturing is 
conducted in a fermentation medium that is magnesium (Mg) limited. 

44. A method, as claimed in Claim 1, wherein said step of culturing is 
conducted in a fermentation medium that is Mg limited during a cell growth phase. 

45. A method, as claimed in Claim 1, wherein said step of culturing is 
conducted in a fermentation medium that comprises less than about 0.5 g/L of Mg during 
a cell growth phase. 

46. A method, as claimed in Claim 1, wherein said step of culturing is 
conducted in a rermentation medium that comprises less than about 0.2 g/L of Mg during 
a cell growth phase. 
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47. A method, as claimed in Claim 1, wherein said step of cuhuring is 
conducted in a fomentation medium that comprises less than about 0. 1 g/L of Mg during 
a cell growth phase. 

48. A method, as claimed in Claim 1, wherein said step of culturing is 
5 conducted in a fermentation medium that comprises a carbon source other than D- 

mannose. 

49. A method, as claimed in Claim 1, wherein said step of cuhuring is 
conducted in a fermentation medium that comprises glucose as a carbon source. 

50. A microorganism for producing ascorbic acid or esters thereof, wherein 
10 said microorganism has a genetic modification to increase the action of an enzyme 

selected from the group consisting of hexokinase, glucose phosphate isomerase, 
phosphomannose isomerase, plwsphonianriorautase, GDP-D-marmose pyrophosphorylase, 
GDP-D-mannose : GDP-L-galactose epimerase, GDP-L-galactose phosphorylase, L- 
galactose- 1 -P-phosphatase, L-galactose dehydrogenase, and L-galactono- y -lactone 
15 dehydrogenase. 

51. A microorganism, as claimed in Claim 50, wherein said genetic 
modification is a genetic modification to increase the action of an enzyme selected from 
the group consisting of GDP-D-mannose : GDP-L-galactose epimerase, GDP-L-galactose 
phosphorylase, L-galactose- 1 -P-phosphatase, L-galactose dehydrogenase, and L- 

20 galactono-y -lactone dehydrogenase. 

52. A microorganism, as claimed in Claim 50, wherein said genetic 
modification is a genetic modification to increase the action of GDP-D-mannose GDP-L- 
galactose epimerase. 

53. A microorganism, as claimed in Claim 50, wherein said microorganism has 
25 been genetically modified to express a recombinant nucleic acid molecule encoding an 

epimerase that catalyzes conversion of GDP-D-mannose to GDP-L-galactose, wherein 
said epimerase has a tertiary structure having an average root mean square deviation of 
less than about 2.5 A over at least about 25% of Cct positions of the tertiary structure of 
a GDP-4-keto-6-deoxy-D-mannose epimerase/reductase represented by atomic 
30 coordinates having Brookhaven Protein Data Bank Accession Code lbws 
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54. A microorganism, as claimed in Claim 50, wherein said microorganism is 
selected from the group consisting of bacteria, fungi and microalgae. 

55. A rnkroorganisni, as claimed in Claim 50, wherein said microorganism is 
a bacterium. 

5 56. A microorganism, as claimed in Claim 55, wherein said bacterium is 

selected from the group consisting of Azotobacter and Pseudomonas 

57. A microorganism, as claimed in Claim 50, wherein said microorganism is 
a fungus. 

58. A microorganism, as claimed in Claim 57, wherein said microorganism is 

10 a yeast. 

59. A microorganism, as claimed in Claim 58, wherein said yeast is selected 
from the group consisting of Saccharomyces yeast. 

60. A plant for producing ascorbic acid or esters thereof, wherein said plant 
has a genetic modification to increase the action of an enzyme selected from the group 

15 consisting of hexokinase, glucose phosphate isomerase, phosphomannose isomerase, 
phosphomannornutase, GDP-D-mannose pyrophosphorylase, GDP-D-mannose: GDP -L- 
galactose epimerase, GDP-L-galactose phosphorylase, L~galactose- 1 -P-phosphatase, L- 
galactose dehydrogenase, and L-galactono- y -lactone dehydrogenase. 

61. A plant, as claimed in Claim 60, wherein said genetic modification is a 
20 genetic modification to increase the action of an enzyme selected from the group 

consisting of GDP-D-mannose:GDP-L-galactose epimerase, GDP-L-galactose 
phosphorylase, L-galactose-l-P-phosphatase, L-galactose dehydrogenase, and L- 
galactono-y -lactone dehydrogenase. 

62. A plant, as claimed in Claim 60, wherein said genetic modification is a 
25 genetic modification to increase the action of GDP-D-mannose GDP-L-galactose 

epimerase. 

63. A plant, as claimed in Claim 60, wherein said plant has been genetically 
modified to express a recombinant nucleic acid molecule encoding an epimerase that 
catalyzes conversion of GDP-D-mannose to GDP-L-galactose, wherein said epimerase 

30 has a tertiary structure having an average root mean square deviation of less than about 
2 5 A over at least about 25% of Ccc positions of the tertiary structure of a GDP-4-keto-6- 
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deoxy-D-mannose epimerase/reductase represented by atomic coordinates having 
Brookhaven Protein Data Bank Accession Code lbws. 

64. A plant, as claimed in Claim 60, wherein said plant further comprises a 
genetic modification to decrease the action of an enzyme having GDP-D-mannose as a 

5 substrate other than GDP -D-mannose: GDP-L-gal actose epimerase. 

65. A plant, as claimed in Claim 60, wherein said generic modification to 
decrease the action of an enzyme having GDP-D-mannose as a substrate other than GDP- 
D-irtannose:GDP-L-galactose epimerase is a genetic modification to decrease the action 
of GDP-D-mannose-dehydrogenase. 

10 66. A plant, as claimed in Claim 60, wherein said plant is a microalga. 

67. A plant, as claimed in Claim 66, wherein said plant is selected from the 
group consisting of microalgae of the genera Prototheca and Chlorella. 

68. A plant, as claimed in Claim 66, wherein said microalga is selected from 
the genus Prototheca. 

15 69. A plant, as claimed in Claim 60, wherein said plant is a higher plant. 

70. A plant, as claimed in Claim 60, wherein said plant is a consumable higher 

plant. 

71 . A microorganism for producing ascorbic acid or esters thereof, wherein 
said microorganism has been genetically modified to express a recombinant nucleic acid 

20 molecule encoding an epimerase that catalyzes conversion of GDP-D-mannose to GDP-L- 
galactose, wherein said epimerase comprises an amino acid sequence that aligns with SEQ 
ID NO:l 1 using a CLUSTAL alignment program, wherein amino acid residues in said 
amino acid sequence align with 100% identity with at least about 50% of non-Xaa 
residues in SEQ ID NO:l 1. 

25 72. A plant for producing ascorbic acid or esters thereof; wherein said plant 

has been genetically modified to express a recombinant nucleic acid molecule encoding 
an epimerase that catalyzes conversion of GDP-D-mannose to GDP-L-gal actose, wherein 
said epimerase comprises an amino acid sequence that aligns with SEQ ID NO: 1 1 using 
a CLUSTAL alignment program, wherein amino acid residues in said amino acid sequence 

30 align with 100% identity with at least about 50% of non-Xaa residues in SEQ ID NO: 1 1 
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SEQUENCE LISTING 
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<110> Berry, Alan 

Running, Jeffrey A. 
Severson, David K. 
Burlingame, Richard P. 

<120> "VITAMIN C PRODUCTION IN MICROORGANISMS AND PLANTS" 

<130> 3161-24-PCT 

<140> not yet assigned 
<141> 1999-05-25 

<150> 60/125,073 
<151> 1999-03-17 

<150> 60/125,054 
<151> 1999-03-18 

<150> 60/088,549 
<151> 1998-06-08 

<160> 15 

<170> Patentln Ver. 2.0 

<210> 1 
<211> 1583 
<212> DNA 

<213> Arabidopsis thaliana 

<220> 

<221> CDS 

<222> (49) . . (993) 

<400> 1 

tagtctttaa tttcgcagcg tttttataat tgtgcagagg tttcgtcc atg tct gac 57 

Met Ser Asp 
1 

aaa tct gcc aaa ate ttc gtc gcg ggt cat cgt ggt ttg gtt gga tct 105 
Lys Ser Ala Lys lie Phe Val Ala Gly His Arg Gly Leu Val Gly Ser 
5 10 15 

gcc att gtc cgc aag ctt cag gaa caa ggt ttc acc aat etc gtt ctt 153 
Ala lie Val Arg Lys Leu Gin Glu Gin Gly Phe Thr Asn Leu Val Leu 
20 25 30 35 
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aaa aca cac gcc gag ctt gat etc act cgt caa gec gat gtt gaa tec 201 
Lys Thr His Ala Glu Leu Asp Leu Thr Arg Gin Ala Asp Val Glu Ser 
40 45 50 



ttc ttt tct caa gag aag cca gtt tat gta ate eta gca gca get aaa 249 
Phe Phe Ser Gin Glu Lys Pro Val Tyr Val He Leu Ala Ala Ala Lys 
55 60 65 



gtt ggt ggt att cac get aac aac acc tat cct get gat ttc att ggt 297 
Val Gly Gly He His Ala Asn Asn Thr Tyr Pro Ala Asp Phe lie Gly 
70 75 80 



gtc aat etc cag att cag acc aat gtg ate cac tct gca tat gag cac 345 
Val Asn Leu Gin He Gin Thr Asn Val lie His Ser Ala Tyr Glu His 
85 90 95 



ggt gtg aag aag ctt etc ttc ctt gga tea tec tgc att tac cct aaa 393 
Gly Val Lys Lys Leu Leu Phe Leu Gly Ser Ser Cys He Tyr Pro Lys 
100 105 110 115 



ttt get cct cag cca att cct gag tct get ttg tta aca gca teg ctt 441 
Phe Ala Pro Gin Pro He Pro Glu Ser Ala Leu Leu Thr Ala Ser Leu 
120 125 130 



gaa cca act aat gag tgg tat get att get aag ate get ggg att aag 489 
Glu Pro Thr Asn Glu Trp Tyr Ala He Ala Lys He Ala Gly He Lys 
135 140 145 



act tgt cag get tat agg att cag cac gga tgg gat gca ate tct ggc 537 
Thr Cys Gin Ala Tyr Arg He Gin His Gly Trp Asp Ala He Ser Gly 
150 155 160 



atg cct act aat etc tat ggt cct aat gac aat ttc cac ccg gag tct 585 
Met Pro Thr Asn Leu Tyr Gly Pro Asn Asp Asn Phe His Pro Glu Ser 
165 170 175 



cat gtg ctt cct get ctt atg agg agg ttc cac gag gcg aaa gtg aat 633 
His Val Leu Pro Ala Leu Met Arg Arg Phe His Glu Ala Lys Val Asn 
180 185 190 195 



tgg age gga gga agt tgt ggt gtg ggg tac aag gta gtc ccg ttg gaa 681 
Trp Ser Gly Gly Ser Cys Gly Val Gly Tyr Lys Val Val Pro Leu Glu 
200 205 210 



ggg aag ttc ttg cat gtt gat gat ttg get gat get tgt gtt ttc ttg 729 
Gly Lys Phe Leu His Val Asp Asp Leu Ala Asp Ala Cys Val Phe Leu 
215 220 225 
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ctg gat egg ata cag egg ggg ttg gag cat gtt aac att gga agt ggt 777 
Leu Asp Arg lie Gin Arg Gly Leu Glu His Val Asn lie Gly Ser Gly 
230 235 240 

caa gaa gtg act att aga gag ttg get gag ttg gtg aaa gag gtt gtt 825 
Gin Glu Val Thr lie Arg Glu Leu Ala Glu Leu Val Lys Glu Val Val 
245 250 255 

ggt ttt gaa ggg aag ctt gga tgg gat tgc act aag cca gat ggc aca 873 
Gly Phe Glu Gly Lys Leu Gly Trp Asp Cys Thr Lys Pro Asp Gly Thr 
260 265 270 275 

ccg agg aaa ctt atg gac age tea aag etc gcg tct ttg ggt tgg aca 921 
Pro Arg Lys Leu Met Asp Ser Ser Lys Leu Ala Ser Leu Gly Trp Thr 
280 285 290 

cct aag gtt tct ctt aga gat ggt ctg age caa act tat gat tgg tat 969 
Pro Lys Val Ser Leu Arg Asp Gly Leu Ser Gin Thr Tyr Asp Trp Tyr 
295 300 305 

ttg aag aat gtt tgc aac cga taa gttaatggtt tctcttctca tatatacaca 1023 
Leu Lys Asn Val Cys Asn Arg 

310 315 

actattgagt ctcaggtaaa tcagcttatc accacattgt gatttaaacc tttctttgag 1083 

attcgagaat tgcttttttt tttatcaaaa ttgattcatt tagagataag acttgettet 1143 

ttatacaaca ttgtctgagg aattttaatt ttggatctcc gagtatggtc tattattagc 1203 

tctcttctat acaaattatc aaaacagttg taagaagttt caagaaaaac atttgatatc 1263 

tcactaattt ggctatcctt geaagttgea aegctaaaat gacaaataat gaattctegg 1323 

cccaatgggc ttacacaagc cttgttaaag atagcgtgaa caaaacgegg ctcactagcc 1383 

ctaacctgtc tctctttcgc ttaccttctt cttcgtcttc gttggctcag tcacttgact 1443 

tcacggcccg ctcaagctct gacacgaaac tcatttcaaa ttaatttaat aaaaccttaa 1503 

tcacaaaagg ggcaaaagca atcgccggcg attatgeett ctcctccggt geeggagacg 1563 

gttgtgagcc aacccgttcg 1583 



<210> 2 
<211> 314 
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<212> PRT 

<213> Arabidopsis thaliana 
<400> 2 

Met Ser Asp Lys Ser Ala Lys lie Phe Val Ala Gly His Arg Gly Leu 
15 10 15 

Val Gly Ser Ala lie Val Arg Lys Leu Gin Glu Gin Gly Phe Thr Asn 
20 25 30 

Leu Val Leu Lys Thr His Ala Glu Leu Asp Leu Thr Arg Gin Ala Asp 
35 40 45 

Val Glu ser Phe Phe Ser Gin Glu Lys Pro Val Tyr Val lie Leu Ala 
50 55 60 

Ala Ala Lys Val Gly Gly lie His Ala Asn Asn Thr Tyr Pro Ala Asp 
65 70 75 80 

Phe lie Gly Val Asn Leu Gin lie Gin Thr Asn Val Tie His Ser Ala 
85 90 95 

Tyr Glu His Gly Val Lys Lys Leu Leu Phe Leu Gly Ser Ser Cys lie 
100 105 110 

Tyr Pro Lys Phe Ala Pro Gin Pro lie Pro Glu Ser Ala Leu Leu Thr 
115 120 125 

Ala Ser Leu Glu Pro Thr Asn Glu Trp Tyr Ala lie Ala Lys lie Ala 
130 135 140 

Gly He Lys Thr Cys Gin Ala Tyr Arg He Gin His Gly Trp Asp Ala 
145 150 155 160 

He Ser Gly Met Pro Thr Asn Leu Tyr Gly Pro Asn Asp Asn Phe His 
165 170 175 

Pro Glu Ser His Val Leu Pro Ala Leu Met Arg Arg Phe His Glu Ala 
180 185 190 

Lys Val Asn Trp Ser Gly Gly Ser Cys Gly Val Gly Tyr Lys Val Val 
195 200 205 

Pro Leu Glu Gly Lys Phe Leu His Val Asp Asp Leu Ala Asp Ala Cys 

210 215 220 

Val Phe Leu Leu Asp Arg He Gin Arg Gly Leu Glu His Val Asn He 
225 230 235 240 



4 



WO 99/64618 



PCT/US99/11576 



Gly Ser Gly Gin 



Glu val Val Gly 
260 

Asp Gly Thr Pro 
275 

Gly Trp Thr Pro 
290 

Asp Trp Tyr Leu 
305 



Glu Val Thr lie 
245 

Phe Glu Gly Lys 



Arg Lys Leu Met 
280 

Lys Val Ser Leu 
295 

Lys Asn Val Cys 
310 



Arg Glu Leu Ala 
250 

Leu Gly Trp Asp 
265 

Asp Ser Ser Lys 



Arg Asp Gly Leu 
300 

Asn Arg 



Glu Leu Val Lys 
255 

Cys Thr Lys Pro 
270 

Leu Ala Ser Leu 
285 

Ser Gin Thr Tyr 



<210> 3 
<211> 966 
<212> DNA 

<213> Escherichia coli 

<220> 

<221> CDS 

<222> (1) . . (966) 

<400> 3 

atg agt aaa caa cga gtt ttt att get ggt cat cgc ggg atg gtc ggt 48 
Met Ser Lys Gin Arg Val Phe lie Ala Gly His Arg Gly Met Val Gly 
15 10 15 

tec gec ate agg egg cag etc gaa cag cgc ggt gat gtg gaa ctg gta 96 
Ser Ala lie Arg Arg Gin Leu Glu Gin Arg Gly Asp Val Glu Leu Val 
20 25 30 

tta cgc acc cgc gac gag ctg aac ctg ctg gac age cgc gec gtg cat 144 
Leu Arg Thr Arg Asp Glu Leu Asn Leu Leu Asp Ser Arg Ala Val His 
35 40 45 

gat ttc ttt gee age gaa cgt att gac cag gtc tat ctg gcg gcg gcg 192 
Asp Phe Phe Ala Ser Glu Arg lie Asp Gin Val Tyr Leu Ala Ala Ala 
50 55 60 

aaa gtg ggc ggc att gtt gec aac aac acc tat ccg gcg gat ttc ate 240 
Lys Val Gly Gly lie Val Ala Asn Asn Thr Tyr Pro Ala Asp Phe lie 
65 70 75 B0 

tac cag aac atg atg att gag age aac ate att cac gec gcg cat cag 288 



5 



WO 99/64618 PCT/US99/1 1576 

Tyr Gin Asn Met Met lie Glu Ser Asn lie lie His Ala Ala His Gin 
85 90 95 

aac gac gtg aac aaa ctg ctg ttt etc gga teg tec tgc ate tac ccg 336 
Asn Asp Val Asn Lys Leu Leu Phe Leu Gly Ser Ser Cys lie Tyr Pro 
100 105 110 

aaa ctg gca aaa cag ccg atg gca gaa age gag ttg ttg cag ggc acg 384 
Lys Leu Ala Lys Gin Pro Met Ala Glu Ser Glu Leu Leu Gin Gly Thr 
115 120 125 

ctg gag ccg act aac gag cct tat get att gee aaa ate gec ggg ate 432 
Leu Glu Pro Thr Asn Glu Pro Tyr Ala lie Ala Lys lie Ala Gly lie 
130 135 140 

aaa ctg tgc gaa tea tac aac cgc cag tac gga cgc gat tac cgc tea 480 
Lys Leu Cys Glu Ser Tyr Asn Arg Gin Tyr Gly Arg Asp Tyr Arg Ser 
145 150 155 160 

gtc atg ccg acc aac ctg tac ggg cca cac gac aac ttc cac ccg agt 528 
Val Met Pro Thr Asn Leu Tyr Gly Pro His Asp Asn Phe His Pro Ser 
165 170 175 

aat teg cat gtg ate cca gca ttg ctg cgt cgc ttc cac gag gcg acg 576 
Asn Ser His Val lie Pro Ala Leu Leu Arg Arg Phe His Glu Ala Thr 
180 185 190 

gca cag aat gcg ccg gac gtg gtg gta tgg ggc age ggt aca ccg atg 624 
Ala Gin Asn Ala Pro Asp Val Val Val Trp Gly Ser Gly Thr Pro Met 
195 200 205 

cgc gaa ttt ctg cac gtc gat gat atg gcg gcg gcg age att cat gtc 672 
Arg Glu Phe Leu His Val Asp Asp Met Ala Ala Ala Ser lie His Val 
210 215 220 

atg gag ctg gcg cat gaa gtc tgg ctg gag aac acc cag ccg atg ttg 720 
Met Glu Leu Ala His Glu Val Trp Leu Glu Asn Thr Gin Pro Met Leu 
225 230 235 240 

teg cac att aac gtc ggc acg ggc gtt gac tgc act ate cgc gac gtg 768 
Ser His He Asn Val Gly Thr Gly Val Asp Cys Thr He Arg Asp Val 
245 250 255 

gcg caa acc ate gee aaa gtg gtg ggt tac aaa ggc egg gtg gtt ttt 816 
Ala Gin Thr He Ala Lys Val Val Gly Tyr Lys Gly Arg Val Val Phe 
260 265 270 

gat gec age aaa ccg gat ggc acg ccg cgc aaa ctg ctg gat gtg acg 864 
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Asp Ala Ser 

275 

cgc ctg cat 
Arg Leu His 
290 

ctt gcc age 
Leu Ala Ser 
305 

ggg taa 
Gly 



Lys Pro Asp 



cag ctt ggc 
Gin Leu Gly 



act tac cag 
Thr Tyr Gin 
310 



Gly Thr Pro 
280 

tgg tat cac 
Trp Tyr His 
295 

tgg ttc ctt 
Trp Phe Leu 



Arg Lys Leu 

gaa ate tea 
Glu lie Ser 
300 

gag aat caa 
Glu Asn Gin 
315 



Leu Asp Val 
285 

ctg gaa gcg 
Leu Glu Ala 



gac cgc ttt 
Asp Arg Phe 



PCT/US99/11576 

Thr 

ggg 912 
Gly 

egg 960 

Arg 

320 

966 



<210> 4 
<211> 321 
<212> PRT 

<213> Escherichia coli 
<400> 4 

Met Ser Lys Gin Arg Val Phe lie Ala Gly His Arg Gly Met Val Gly 
15 10 15 

Ser Ala lie Arg Arg Gin Leu Glu Gin Arg Gly Asp Val Glu Leu Val 
20 25 30 

Leu Arg Thr Arg Asp Glu Leu Asn Leu Leu Asp Ser Arg Ala Val His 
35 40 45 

Asp Phe Phe Ala Ser Glu Arg lie Asp Gin Val Tyr Leu Ala Ala Ala 
50 55 60 

Lys Val Gly Gly lie Val Ala Asn Asn Thr Tyr Pro Ala Asp Phe lie 
65 70 75 80 

Tyr Gin Asn Met Met lie Glu Ser Asn lie lie His Ala Ala His Gin 
85 90 95 

Asn Asp Val Asn Lys Leu Leu Phe Leu Gly Ser Ser Cys lie Tyr Pro 
100 105 110 

Lys Leu Ala Lys Gin Pro Met Ala Glu Ser Glu Leu Leu Gin Gly Thr 
115 120 125 

Leu Glu Pro Thr Asn Glu Pro Tyr Ala lie Ala Lys lie Ala Gly lie 
130 135 140 
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Lys Leu Cys Glu Ser Tyr Asn Arg Gin Tyr Gly Arg Asp Tyr Arg Ser 
145 150 155 160 

Val Met Pro Thr Asn Leu Tyr Gly Pro His Asp Asn Phe His Pro Ser 
165 170 175 

Asn Ser His Val lie Pro Ala Leu Leu Arg Arg Phe His Glu Ala Thr 
180 185 190 

Ala Gin Asn Ala Pro Asp Val Val Val Trp Gly Ser Gly Thr Pro Met 
195 200 205 

Arg Glu Phe Leu His Val Asp Asp Met Ala Ala Ala Ser He His Val 
210 215 220 

Met Glu Leu Ala His Glu Val Trp Leu Glu Asn Thr Gin Pro Met Leu 
225 230 235 240 

Ser His He Asn Val Gly Thr Gly Val Asp Cys Thr He Arg Asp Val 
245 250 255 

Ala Gin Thr He Ala Lys Val Val Gly Tyr Lys Gly Arg Val Val Phe 
260 265 270 

Asp Ala Ser Lys Pro Asp Gly Thr Pro Arg Lys Leu Leu Asp Val Thr 
275 280 285 

Arg Leu His Gin Leu Gly Trp Tyr His Glu He Ser Leu Glu Ala Gly 
290 295 300 

Leu Ala Ser Thr Tyr Gin Trp Phe Leu Glu Asn Gin Asp Arg Phe Arg 
305 310 315 320 

Gly 



<210> 5 

<211> 1340 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> CDS 

<222> (75) . . (1040) 

<400> 5 

ctagaattca gcggccgctg aattctagct agaattcagc ggccgctgaa ttctagaacc 60 
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caggtgcaac 

aca ggg ggc 
Thr Gly Gly 
15 

gat gga get 
Asp Gly Ala 
30 

gac gec gat 
Asp Ala Asp 
45 

gtc caa ccc 
Val Gin Pro 



ttc egg aat 
Phe Arg Asn 

atg aac gac 
Met Asn Asp 
95 

gtg gtg tec 
Val Val Ser 
110 

ccg ata gat 
Pro lie Asp 
125 

ttt ggg tac 
Phe Gly Tyr 



tac ttc cag 
Tyr Phe Gin 



gtt ttc ggg 
Val Phe Gly 
175 



ac atg ggt 
Met Gly 

1 

tct ggg ctg 
Ser Gly Leu 



gga ctt cct 
Gly Leu Pro 

etc acg gat 
Leu Thr Asp 
50 

aca cac gtc 
Thr His Val 
65 

ate aaa tac 
lie Lys Tyr 
80 

aac gtc ctg 
Asn Val Leu 



tgc ctg tec 
Cys Leu Ser 



gag acc atg 
Glu Thr Met 
130 

teg tat gec 
Ser Tyr Ala 
145 

cag tac ggc 
Gin Tyr Gly 
160 

ccc cac gac 
Pro His Asp 



gaa ccc cag 
Glu Pro Gin 
5 

gta ggc aaa 
Val Gly Lys 
20 

gga gag gac 
Gly Glu Asp 
35 

aca gca cag 
Thr Ala Gin 



ate cat ctt 
lie His Leu 



aat ttg gac 
Asn Leu Asp 
85 

cac teg gec 
His Ser Ala 
100 

acc tgt ate 
Thr Cys lie 
115 

ate cac aat 
lie His Asn 



aag agg atg 
Lys Arg Met 



tgc acc ttc 
Cys Thr Phe 
165 

aac ttc aac 
Asn Phe Asn 
180 



gga tec atg 
Gly Ser Met 



gec ate cag 
Ala lie Gin 



tgg gtg ttt 
Trp Val Phe 
40 

acc cgc gec 
Thr Arg Ala 
55 

get gca atg 
Ala Ala Met 
70 

ttc tgg agg 
Phe Trp Arg 



ttt gag gtg 
Phe Glu Val 



ttc cct gac 
Phe Pro Asp 
120 

ggg cct ccc 
Gly Pro Pro 
135 

ate gac gtg 
lie Asp Val 
150 

acc get gtc 
Thr Ala Val 



ate gag gat 
He Glu Asp 



egg att eta 
Arg He Leu 
10 

aag gtg gta 
Lys Val Val 
25 

gtc tec tct 
Val Ser Ser 



ctg ttt gag 
Leu Phe Glu 



gtg ggg ggc 
Val Gly Gly 
75 

aaa aac gtg 
Lys Asn Val 
90 

ggg gec cgc 
Gly Ala Arg 
105 

aag acg acc 
Lys Thr Thr 



cac aac age 
His Asn Ser 



cag aac agg 
Gin Asn Arg 
155 

ate ccc acc 
He Pro Thr 
170 

ggc cac gtg 
Gly His Val 
185 



PCT/US99/I1576 

gtg 110 
Val 

gca 158 
Ala 



aaa 206 
Lys 

aag 254 
Lys 

60 

ctg 302 
Leu 



cac 350 
His 



aag 398 
Lys 

tac 446 
Tyr 

aat 494 

Asn 

140 

gec 542 
Ala 

aac 590 
Asn 



ctg 638 
Leu 
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cct ggc etc ate cac aag gtg cac ctg gec aag age age ggc teg gec 686 

Pro Gly Leu lie His Lys Val His Leu Ala Lys Ser Ser Gly Ser Ala 

190 195 200 

ctg acg gtg tgg ggt aca ggg aat ccg egg agg cag ttc ata tac teg 734 
Leu Thr Val Trp Gly Thr Gly Asn Pro Arg Arg Gin Phe lie Tyr Ser 
205 210 215 220 

ctg gac ctg gec cag etc ttt ate tgg gtc ctg egg gag tac aat gaa 782 
Leu Asp Leu Ala Gin Leu Phe lie Trp Val Leu Arg Glu Tyr Asn Glu 
225 230 235 

gtg gag ccc ate ate etc tec gtg ggc gag gaa gat gag gtc tec ate 
Val Glu Pro He He Leu Ser Val Gly Glu Glu Asp Glu Val Ser He 
240 245 250 

aag gag gca gec gag gcg gtg gtg gag gec atg gac ttc cat ggg gaa 
Lys Glu Ala Ala Glu Ala Val Val Glu Ala Met Asp Phe His Gly Glu 
255 260 265 

gtc ace ttt gat aca acc aag teg gat ggg cag ttt aag aag aca gec 
Val Thr Phe Asp Thr Thr Lys Ser Asp Gly Gin Phe Lys Lys Thr Ala 
270 275 280 

agt aac age aag ctg agg acc tac ctg ccc gac ttc egg ttc aca ccc 
ser Asn Ser Lys Leu Arg Thr Tyr Leu Pro Asp Phe Arg Phe Thr Pro 
285 290 295 300 

ttc aag cag gcg gtg aag gag acc tgt get tgg ttc act gac aac tac 
Phe Lys Gin Ala Val Lys Glu Thr Cys Ala Trp Phe Thr Asp Asn Tyr 
305 310 315 

gag cag gee egg aag tga agctggaaga caggatcagg tgccagcgga 1070 
Glu Gin Ala Arg Lys 
320 

ccatcggctg gcagagccca gcggccacca cccgtcaacc ctgecaggag ctgagggcac 1130 

cacccagcaa cctgggcctg cattccatcc gctctgcagc cccaagcatc tttccagtgg 1190 

ggcccccatt cacgttggtc ctcagggaaa ccagggtccg gggcaggccc ggcgctttgc 1250 

tccccacacc agccccctgc gcgtgtccac tctgatcctg catcccactc cctgggagcc 1310 

aataaagtgc attttcacag aaaaaaaaaa 1340 



830 



878 



926 



974 



1022 



<210> 6 
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<211> 321 
<212> PRT 
<213> Homo sapiens 

<400> 6 

Met Gly Glu Pro Gin Gly Ser Met Arg lie Leu Val Thr Gly Gly Ser 
15 10 15 

Gly Leu Val Gly Lys Ala lie Gin Lys Val Val Ala Asp Gly Ala Gly 
20 25 30 

Leu Pro Gly Glu Asp Trp Val Phe Val Ser Ser Lys Asp Ala Asp Leu 
35 40 45 

Thr Asp Thr Ala Gin Thr Arg Ala Leu Phe Glu Lys Val Gin Pro Thr 
50 55 60 

His Val lie His Leu Ala Ala Met Val Gly Gly Leu Phe Arg Asn lie 
65 70 75 80 

Lys Tyr Asn Leu Asp Phe Trp Arg Lys Asn Val His Met Asn Asp Asn 
85 90 95 

Val Leu His Ser Ala Phe Glu Val Gly Ala Arg Lys Val Val Ser Cys 
100 105 110 

Leu Ser Thr Cys lie Phe Pro Asp Lys Thr Thr Tyr Pro lie Asp Glu 
115 120 125 

Thr Met lie His Asn Gly Pro Pro His Asn Ser Asn Phe Gly Tyr Ser 
130 135 140 

Tyr Ala Lys Arg Met lie Asp Val Gin Asn Arg Ala Tyr Phe Gin Gin 
145 150 155 160 

Tyr Gly Cys Thr Phe Thr Ala Val lie Pro Thr Asn Val Phe Gly Pro 
165 170 175 

His Asp Asn Phe Asn lie Glu Asp Gly His Val Leu Pro Gly Leu lie 
180 185 190 

His Lys Val His Leu Ala Lys Ser Ser Gly Ser Ala Leu Thr Val Trp 
195 200 205 

Gly Thr Gly Asn Pro Arg Arg Gin Phe lie Tyr Ser Leu Asp Leu Ala 
210 215 220 

Gin Leu Phe lie Trp Val Leu Arg Glu Tyr Asn Glu Val Glu Pro lie 
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225 



230 



235 
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240 



He Leu Ser Val Gly Glu Glu Asp Glu Val Ser He Lys Glu Ala Ala 
245 250 255 

Glu Ala Val Val Glu Ala Met Asp Phe His Gly Glu Val Thr Phe Asp 
260 265 270 

Thr Thr Lys Ser Asp Gly Gin Phe Lys Lys Thr Ala Ser Asn Ser Lys 
275 280 285 

Leu Arg Thr Tyr Leu Pro Asp Phe Arg Phe Thr Pro Phe Lys Gin Ala 
290 295 300 

Val Lys Glu Thr Cys Ala Trp Phe Thr Asp Asn Tyr Glu Gin Ala Arg 
305 310 315 320 

Lys 



<210> 7 
<211> 1017 
<212> DNA 

<213> Escherichia coli 

<220> 

<221> CDS 

<222> (1) . . (1017) 

<400> 7 

atg aga gtt ctg gtt acc ggt ggt age ggt tac att gga agt cat acc 48 

Met Arg Val Leu Val Thr Gly Gly Ser Gly Tyr He Gly Ser His Thr 
15 10 15 

tgt gtg caa tta ctg caa aac ggt cat gat gtc ate att ctt gat aac 96 
Cys Val Gin Leu Leu Gin Asn Gly His Asp Val He He Leu Asp Asn 
20 25 30 

etc tgt aac agt aag cgc age gta ctg cct gtt ate gag cgt tta ggc 144 
Leu Cys Asn Ser Lys Arg Ser Val Leu Pro Val He Glu Arg Leu Gly 
35 40 45 

ggc aaa cat cca acg ttt gtt gaa ggc gat att cgt aac gaa gcg ttg 192 
Gly Lys His Pro Thr Phe Val Glu Gly Asp He Arg Asn Glu Ala Leu 
50 55 60 

atg acc gag ate ctg cac gat cac get ate gac acc gtg ate cac ttc 240 
Met Thr Glu He Leu His Asp His Ala He Asp Thr Val He His Phe 
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65 70 75 80 

gcc ggg ctg aaa gcc gtg ggc gaa teg gta caa aaa ccg ctg gaa tat 288 
Ala Gly Leu Lys Ala Val Gly Glu Ser Val Gin Lys Pro Leu Glu Tyr 
85 90 95 

tac gac aac aat gtc aac ggc act ctg cgc ctg att age gcc atg cgc 336 
Tyr Asp Asn Asn Val Asn Gly Thr Leu Arg Leu He Ser Ala Met Arg 
100 105 110 

gcc get aac gtc aaa aac ttt att ttt age tec tec gcc acc gtt tat 384 
Ala Ala Asn Val Lys Asn Phe He Phe Ser Ser Ser Ala Thr Val Tyr 
115 120 125 

ggc gat cag ccc aaa att cca tac gtt gaa age ttc ccg acc ggc aca 432 
Gly Asp Gin Pro Lys He Pro Tyr Val Glu Ser Phe Pro Thr Gly Thr 
130 135 140 

ccg caa age cct tac ggc aaa age aag ctg atg gtg gaa cag ate etc 480 
Pro Gin Ser Pro Tyr Gly Lys Ser Lys Leu Met Val Glu Gin lie Leu 
145 150 155 160 

acc gat ctg caa aaa gcc cag ccg gac tgg age att gcc ctg ctg cgc 528 
Thr Asp Leu Gin Lys Ala Gin Pro Asp Trp Ser He Ala Leu Leu Arg 
165 170 175 

tac ttc aac ccg gtt ggc gcg cat ccg teg ggc gat atg ggc gaa gat 576 
Tyr Phe Asn Pro Val Gly Ala His Pro Ser Gly Asp Met Gly Glu Asp 
180 185 190 

ccg caa ggc att ccg aat aac ctg atg cca tac ate gcc cag gtt get 624 
Pro Gin Gly He Pro Asn Asn Leu Met Pro Tyr He Ala Gin Val Ala 
195 200 205 

gta ggc cgt cgc gac teg ctg gcg att ttt ggt aac gat tat ccg acc 672 
Val Gly Arg Arg Asp Ser Leu Ala He Phe Gly Asn Asp Tyr Pro Thr 
210 215 220 

gaa gat ggt act ggc gta cgc gat tac ate cac gta atg gat ctg gcg 720 
Glu Asp Gly Thr Gly Val Arg Asp Tyr He His Val Met Asp Leu Ala 
225 230 235 240 

gac ggt cac gtc gtg gcg atg gaa aaa ctg gcg aac aag cca ggc gta 768 
Asp Gly His Val Val Ala Met Glu Lys Leu Ala Asn Lys Pro Gly Val 
245 250 255 

cac ate tac aac etc ggc get ggc gta ggc aac age gtg ctg gac gtg 816 
His lie Tyr Asn Leu Gly Ala Gly Val Gly Asn Ser Val Leu Asp Val 
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270 



gtt aat gcc ttc age aaa 

Val Asn Ala Phe Ser Lys 
275 

gca ccg cgt cgc gag ggc 

Ala Pro Arg Arg Glu Gly 
290 

aaa gcc gac cgt gaa ctg 

Lys Ala Asp Arg Glu Leu 

305 310 

atg gcg cag gac acc tgg 
Met Ala Gin Asp Thr Trp 
325 

ccc gat taa 
Pro Asp 



gcc tgc ggc aaa ccg gtt 

Ala Cys Gly Lys Pro Val 
280 

gac ctt ccg gcc tac tgg 

Asp Leu Pro Ala Tyr Trp 

295 300 

aac tgg cgc gta acg cgc 
Asn Trp Arg Val Thr Arg 
315 

cac tgg cag tea cgc cat 
His Trp Gin Ser Arg His 
330 



aat tat cat ttt 864 

Asn Tyr His Phe 

285 

gcg gac gcc age 912 
Ala Asp Ala Ser 

aca etc gat gaa 960 
Thr Leu Asp Glu 
320 

cca cag gga tat 1008 
Pro Gin Gly Tyr 
335 

1017 



<210> 8 
<211> 338 
<212> PRT 

<213> Escherichia coli 
<400> 8 

Met Arg Val Leu Val Thr Gly Gly Ser Gly Tyr lie Gly Ser His Thr 
15 10 15 

Cys Val Gin Leu Leu Gin Asn Gly His Asp Val lie lie Leu Asp Asn 
20 25 30 

Leu Cys Asn Ser Lys Arg Ser Val Leu Pro Val lie Glu Arg Leu Gly 
35 40 45 

Gly Lys His Pro Thr Phe Val Glu Gly Asp lie Arg Asn Glu Ala Leu 
50 55 60 

Met Thr Glu lie Leu His Asp His Ala lie Asp Thr Val lie His Phe 

65 70 75 80 

Ala Gly Leu Lys Ala Val Gly Glu Ser Val Gin Lys Pro Leu Glu Tyr 
85 90 95 

Tyr Asp Asn Asn Val Asn Gly Thr Leu Arg Leu He Ser Ala Met Arg 
100 105 110 
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Ala Ala Asn Val Lys Asn Phe He Phe Ser Ser ser Ala Thr Val Tyr 
115 120 125 

Gly Asp Gin Pro Lys He Pro Tyr Val Glu Ser Phe Pro Thr Gly Thr 
130 135 140 

Pro Gin Ser Pro Tyr Gly Lys Ser Lys Leu Met val Glu Gin He Leu 
145 150 155 160 

Thr Asp Leu Gin Lys Ala Gin Pro Asp Trp Ser He Ala Leu Leu Arg 
165 170 175 

Tyr Phe Asn Pro Val Gly Ala His Pro Ser Gly Asp Met Gly Glu Asp 
180 185 190 

Pro Gin Gly He Pro Asn Asn Leu Met Pro Tyr He Ala Gin Val Ala 
195 200 205 

Val Gly Arg Arg Asp Ser Leu Ala He Phe Gly Asn Asp Tyr Pro Thr 
210 215 220 

Glu Asp Gly Thr Gly Val Arg Asp Tyr He His Val Met Asp Leu Ala 
225 230 235 240 

Asp Gly His Val Val Ala Met Glu Lys Leu Ala Asn Lys Pro Gly Val 
245 250 255 

His He Tyr Asn Leu Gly Ala Gly Val Gly Asn Ser Val Leu Asp Val 
260 265 270 

Val Asn Ala Phe Ser Lys Ala Cys Gly Lys Pro Val Asn Tyr His Phe 
275 280 285 

Ala Pro Arg Arg Glu Gly Asp Leu Pro Ala Tyr Trp Ala Asp Ala Ser 
290 295 300 

Lys Ala Asp Arg Glu Leu Asn Trp Arg Val Thr Arg Thr Leu Asp Glu 
305 310 315 320 

Met Ala Gin Asp Thr Trp His Trp Gin Ser Arg His Pro Gin Gly Tyr 
325 330 335 

Pro Asp 



<210> 9 
<211> 1047 
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<212> DNA 

<213> Homo sapiens 

<220> 

<221> CDS 

<222> (1) . . (1047) 

<400> 9 

atg gca gag aag gtg ctg gta aca ggt ggg get ggc tac att ggc age 48 

Met Ala Glu Lys Val Leu Val Thr Gly Gly Ala Gly Tyr lie Gly Ser 
15 10 15 

cac acg gtg ctg gag ctg ctg gag get ggc tac ttg cct gtg gtc ate 96 
His Thr Val Leu Glu Leu Leu Glu Ala Gly Tyr Leu Pro Val Val lie 
20 25 30 

gat aac ttc cat aat gee ttc cgt gga ggg ggc tec ctg cct gag age 144 
Asp Asn Phe His Asn Ala Phe Arg Gly Gly Gly Ser Leu Pro Glu Ser 
35 40 45 

ctg egg egg gtc cag gag ctg aca ggc cgc tct gtg gag ttt gag gag 192 
Leu Arg Arg Val Gin Glu Leu Thr Gly Arg Ser Val Glu Phe Glu Glu 
50 55 60 

atg gac att ttg gac cag gga gec eta cag cgt etc ttc aaa aag tac 240 
Met Asp lie Leu Asp Gin Gly Ala Leu Gin Arg Leu Phe Lys Lys Tyr 
65 70 75 80 

age ttt atg gcg gtc ate cac ttt gcg ggg etc aag gec gtg ggc gag 288 
Ser Phe Met Ala Val lie His Phe Ala Gly Leu Lys Ala Val Gly Glu 
85 90 95 

teg gtg cag aag cct ctg gat tat tac aga gtt aac ctg acc ggg ace 336 
Ser Val Gin Lys Pro Leu Asp Tyr Tyr Arg Val Asn Leu Thr Gly Thr 
100 105 110 

ate cag ctt ctg gag ate atg aag gee cac ggg gtg aag aac ctg gtg 384 
lie Gin Leu Leu Glu lie Met Lys Ala His Gly Val Lys Asn Leu Val 
115 120 125 

ttc age age tea gee act gtg tac ggg aac ccc cag tac ctg ccc ctt 432 
Phe Ser Ser Ser Ala Thr Val Tyr Gly Asn Pro Gin Tyr Leu Pro Leu 
130 135 140 

gat gag gec cac ccc acg ggt ggt tgt acc aac cct tac ggc aag tec 480 
Asp Glu Ala His Pro Thr Gly Gly Cys Thr Asn Pro Tyr Gly Lys Ser 
145 150 155 160 
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aag ttc ttc ate gag gaa atg ate egg gac ctg tgc cag gca gac aag 528 

Lys Phe Phe lie Glu Glu Met lie Arg Asp Leu Cys Gin Ala Asp Lys 
165 170 175 



act tgg aac gta gtg ctg ctg cgc tat ttc aac ccc aca ggt gec cat 576 
Thr Trp Asn Val Val Leu Leu Arg Tyr Phe Asn Pro Thr Gly Ala His 
180 185 190 



gec tct ggc tgc att ggt gag gat ccc cag ggc ata ccc aac aac etc 624 
Ala Ser Gly Cys lie Gly Glu Asp Pro Gin Gly lie Pro Asn Asn Leu 
195 200 205 



atg cct tat gtc tec cag gtg gcg ate ggg cga egg gag gec ctg aat 672 
Met Pro Tyr Val Ser Gin Val Ala lie Gly Arg Arg Glu Ala Leu Asn 
210 215 220 



gtc ttt ggc aat gac tat gac aca gag gat ggc aca ggt gtc egg gat 720 
Val Phe Gly Asn Asp Tyr Asp Thr Glu Asp Gly Thr Gly Val Arg Asp 
225 230 235 240 



tac ate cat gtc gtg gat ctg gec aag 
Tyr lie His Val Val Asp Leu Ala Lys 
245 



ggc cac att gca gec tta agg 768 
Gly His lie Ala Ala Leu Arg 
250 255 



aag ctg aaa gaa cag tgt ggc tgc egg ate tac aac ctg ggc acg ggc 816 
Lys Leu Lys Glu Gin cys Gly Cys Arg lie Tyr Asn Leu Gly Thr Gly 
260 265 270 



aca ggc tat tea gtg ctg cag atg gtc cag get atg gag aag gec tct 864 
Thr Gly Tyr Ser Val Leu Gin Met Val Gin Ala Met Glu Lys Ala Ser 
275 280 285 



ggg aag aag ate ccg tac aag gtg gtg gca egg egg gaa ggt gat gtg 912 
Gly Lys Lys lie Pro Tyr Lys Val Val Ala Arg Arg Glu Gly Asp Val 
290 295 300 



gca gee tgt tac gec aac ccc age ctg gec caa gag gag ctg ggg tgg 960 
Ala Ala Cys Tyr Ala Asn Pro Ser Leu Ala Gin Glu Glu Leu Gly Trp 
305 310 315 320 



aca gca gee tta ggg ctg gac agg atg tgt gag gat etc tgg cgc tgg 1008 
Thr Ala Ala Leu Gly Leu Asp Arg Met Cys Glu Asp Leu Trp Arg Trp 
325 330 335 



cag aag cag aat cct tea ggc ttt ggc acg caa gec tga 1047 
Gin Lys Gin Asn Pro Ser Gly Phe Gly Thr Gin Ala 
340 345 
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<210> 10 
<211> 348 
<212> PRT 

<213> Homo sapiens 
<400> 10 

Met Ala Glu Lys Val Leu Val Thr Gly Gly Ala Gly Tyr lie Gly Ser 
15 10 15 

His Thr Val Leu Glu Leu Leu Glu Ala Gly Tyr Leu Pro Val Val lie 
20 25 30 

Asp Asn Phe His Asn Ala Phe Arg Gly Gly Gly Ser Leu Pro Glu Ser 
35 40 45 

Leu Arg Arg Val Gin Glu Leu Thr Gly Arg Ser Val Glu Phe Glu Glu 
50 55 60 

Met Asp lie Leu Asp Gin Gly Ala Leu Gin Arg Leu Phe Lys Lys Tyr 
65 70 75 80 

Ser Phe Met Ala Val lie His Phe Ala Gly Leu Lys Ala Val Gly Glu 
85 90 95 

Ser Val Gin Lys Pro Leu Asp Tyr Tyr Arg Val Asn Leu Thr Gly Thr 
100 105 110 

He Gin Leu Leu Glu He Met Lys Ala His Gly Val Lys Asn Leu Val 
115 120 125 

Phe Ser Ser Ser Ala Thr Val Tyr Gly Asn Pro Gin Tyr Leu Pro Leu 
130 135 140 

Asp Glu Ala His Pro Thr Gly Gly Cys Thr Asn Pro Tyr Gly Lys ser 
145 150 155 160 

Lys Phe Phe He Glu Glu Met He Arg Asp Leu Cys Gin Ala Asp Lys 
165 170 175 

Thr Trp Asn Val Val Leu Leu Arg Tyr Phe Asn Pro Thr Gly Ala His 
180 185 190 

Ala Ser Gly Cys He Gly Glu Asp Pro Gin Gly He Pro Asn Asn Leu 
195 200 205 

Met Pro Tyr Val Ser Gin Val Ala He Gly Arg Arg Glu Ala Leu Asn 

210 215 220 
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Val Phe Gly Asn Asp Tyr Asp Thr Glu Asp Gly Thr Gly Val Arg Asp 
225 230 235 240 

Tyr lie His Val Val Asp Leu Ala Lys Gly His lie Ala Ala Leu Arg 
245 250 255 

Lys Leu Lys Glu Gin Cys Gly Cys Arg lie Tyr Asn Leu Gly Thr Gly 
260 265 270 

Thr Gly Tyr Ser Val Leu Gin Met Val Gin Ala Met Glu Lys Ala Ser 
275 280 285 

Gly Lys Lys lie Pro Tyr Lys Val Val Ala Arg Arg Glu Gly Asp Val 
290 295 300 

Ala Ala Cys Tyr Ala Asn Pro Ser Leu Ala Gin Glu Glu Leu Gly Trp 
305 310 315 320 

Thr Ala Ala Leu Gly Leu Asp Arg Met Cys Glu Asp Leu Trp Arg Trp 

325 330 335 

Gin Lys Gin Asn Pro Ser Gly Phe Gly Thr Gin Ala 
340 345 



<210> 11 
<211> 317 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : CONSENSUS 
<400> 11 

Xaa Xaa Arg Xaa Xaa Xaa Xaa Gly Xaa Xaa Gly Xaa Xaa Gly Xaa Xaa 
15 10 15 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
20 25 30 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
35 40 45 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Val Xaa Xaa Xaa Ala Xaa Xaa Xaa 
50 55 60 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
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65 70 75 80 

Asn Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
85 90 95 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Ser Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
100 105 110 

Xaa Xaa Xaa Pro Xaa Xaa Glu Xaa Xaa Xaa Xaa Xaa Gly Xaa Xaa Xaa 
115 120 125 

Xaa Xaa Xaa Xaa Xaa Tyr Xaa Xaa Xaa Lys Xaa Xaa Xaa Xaa Xaa Xaa 
130 135 140 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
145 150 155 160 

Xaa Xaa Asn Xaa Xaa Gly Xaa His Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
165 170 175 

Xaa Xaa Xaa Pro Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
180 185 190 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Gly Xaa Gly Xaa Xaa Xaa Arg Xaa 
195 200 205 

Xaa Xaa Xaa Xaa Xaa Asp xaa Ala Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
210 215 220 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
225 230 235 240 

Xaa Xaa Xaa Gly Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
245 250 255 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Phe Xaa Xaa 
260 265 270 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
275 280 285 

Xaa Xaa Leu Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
290 295 300 

Xaa Thr Xaa Xaa Trp Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
305 310 315 
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<210> 12 
<211> 34 
<212> DNA 

<213> Escherichia coli 
<400> 12 

tagaattcag taaacaacga gtttttattg ctgg 34 

<210> 13 
<211> 32 
<212> DNA 

<213> Escherichia coli 



<210> 14 
<211> 30 
<212> DNA 

<213> Escherichia coli 
<400> 14 

ctggagtcga attcatgagt aaacaacgag 30 

<210> 15 
<211> 33 
<212> DNA 

<213> Escherichia coli 



<400> 13 



aactcgagtt acccccaaag cggtcttgat tc 



32 



<400> 15 



aactgcagtt acccccgaaa gcggtcttga ttc 



33 
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Box I Observation, whert certain daim. were found unsearchable (Continuation of item 1 of first sheet) 



This international report has not been established m respect of certain claims under Article I7(2X*) ** foUowiog 
Claims No*.: 

because they relate to subject matter not required to be searched by this Authority, namely: 



I. Claims Nos 



2. 



□ Claims Nos.: 
because they relate to parts of the international application that do not comply with the prescribed requirements to such 



an extent that no meaningful international search can be carried out, specifically: 



3. I I Claims Nos.: 

1 1 because they are dependent claims and are not drafted m accordance with the second and third sentences of Rule 6.4(a). 



Boa II Observations where unity of invention is lacking (Continuation of item 2 of first sheep 



This International Searching Authority found multiple inventions in this international application, as follows: 
Please See Extra Sheet. 



As all required additional search fees were tunoty paid by the applicant, this international search report coven all searchable 



1. fx] Asallrequi 
claims. 



2. Q Aa all searchable claims could be searched withont effc« juanfymg an additional fee, ^ 
of any additional fee. 

3 | I As only some of the required additional search fees were timely paid by the applicant, this international search report covers 
— only those claims for which fee* were paid, specifically claims Nos.: 



4. I""! No required additional search fees were timely paid by the applicant Consequently, this international search report is 
' — restricted to the invention first mentioned m the claims; it is covered by claims Nos.: 



Remark on Protest Q The additional search fees were accompanied by the applicant's protest. 

| | No protest accompanied the payment of additional search fees. 
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INTERNATIONAL SEARCH REPORT 



International application No. 

PCT7US99/11576 



BOX II. OBSERVATIONS WHERE UNITY OF INVENTION WAS LACKING 
This ISA found multiple inventions as follows: 

This international Preliminary Examining Authority has found 2 inventions claimed in the International application 
covered by the claims indicated below: 

Group I claims 1-59 and 71. draw, to a method of producing ascorbic acid or esters thereof in a microorganism 
comprising cultunng a microorganism having a genetic modification to increase the action of an enzyme selected from 
the group consisting of hexokina*e, glucose phosphate isomerase etc. as well as a microorganism genetically mod.fied 
for producing ascorbic acid. 

Group II claims 60-70 and 72, drawn to a plant for producing ascorbic acid or esters thereof, wherein said plant has a 
genetic modification to increase the action of an enzyme selected from the group consisting of hexokinase. glucose 
phosphate isomerase etc. 

The inventions listed as Groups 1-11 do not relate to a single inventive concept because they are considered to be two 
different categories of invention and are not drawn to combination of categories (i.e. categories 1-5), specified m 37 CFR 
section l-475(b> 



Form PCT/ISA/210 (extra ibeelXJuly 1992)* 



